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Moedas’s legacy — 
and what Europe 
must do next 


The successor tothe European Union’s research 
chief must act to prevent budget cuts. 


arlos Moedas was little known outside Portugal 
when he took over as the European Union’s 
research and innovation chief in 2014. 
Now, at the end of his tenure, that isnolonger 
the case. In five years, the engineer-turned- 
banker-turned politician has demonstrated thoughtful 
advocacy for research. He has listened to researchers and 
delivered — except on one issue where it really matters. 

The funding settlement for Horizon Europe, the next 
research framework programme for all EU member states, 
has hit a roadblock. Moedas’s successor, Mariya Gabriel, 
and Europe as a whole must work hard to fight cuts and 
potential delays to its start. 

On the positive side of the ledger, it is because of Moedas 
that around €9 billion (US$10 billion) — around one-tenth 
of the next round of European research funding — will be 
set aside for large collaborations in five global challenges or 
‘missions’ —inclimate change, cancer, oceans, smart cities, 
and soil and food. This was an idea that Moedas adopted 
after discussions with researchers, notably the innovation 
economist Mariana Mazzucato. 

But a European research commissioner’s core job — 
some would argue the most important one — is to protect 
the budget. Earlier this month, negotiations between EU 
member states on the next seven-year budget cycle (for 
2021-27) stalled. The European Commission is asking for 
€1.135 trillion, including around €100 billion for research. 
Member states want to cut the total budget by between 
€35 billion and €85 billion. Facing such a shortfall, it isn’t 
uncommon for those in charge of setting budgets to look 
to research for cuts. 

Protecting researchneeds firepower — it requires support 
from heads of government, and especially from national 
ministries of finance. Moedas and his boss, commission 
presidentJean-ClaudeJuncker, should have assembled high- 
level support much earlier, before we got to this point. The 
responsibility for ensuring that research does not bear the 
brunt of any cuts now falls to Gabriel. 

An added complication is that, under the incoming 
commission, the department for research and innovation 
is being merged with that for education, youth, sport and 
culture. This expanded departmentis called Innovation and 
Youth — ‘research’ has been lost from the title — and Gabriel 
will have extra, and possibly competing, priorities, one of 
whichis a trebling of the budget for the student-exchange 
programme Erasmust. 
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Protecting 
research 
needs 
firepower — 

it requires 
support from 
heads of 
government.” 


Moedas has been a popular commissioner, known as a 
team player and a conciliator — playing the ‘good cop’ to 
his former head of research Robert-Jan Smits’s ‘bad cop’ in 
budget discussions. Smits describes Moedas as “a genuine, 
nice person who doesn't like to put peoplein an uncomfort- 
able situation”. These are important qualities. 

But the EU faces some significant challenges, and 
Gabriel will need to adopt a tougher persona. Economies 
are slowing; austerity has been painful and many govern- 
ments want to spend more at home onsocial programmes. 
Atthesametime, budget planners will need to adjust forthe 
potential absence of — or reductionin—the UK contribution 
to the EU. 

If they want to see their EU research budgets protected, 
research organizations can help Gabriel by putting pressure 
ontheirnational governments, especially finance ministries. 
Everyone needs to push harder to protect funding — so that 
the spirit and support that has helped make the EU a model 
for collaborative research can live on. 


Brexit promises 
are premature 


Government offers of new funds for 
UK scientists could be unaffordable. 


here’saresearchgroupinBritainthathas become 

a staple of the country’s news shows, and it’s 

called The UK in a Changing Europe. On most 

nights, the team of political scientists, econo- 

mists and lawyers dispassionately responds to 
broadcasters’ questions ontheimpact — economic, political 
and societal — of the United Kingdom’s departure from the 
European Union. 

The researchers, who are funded by the UK government’s 
Economic and Social Research Council — but whose work is 
independent of the government's own policies — donot have 
an easy task. But it’s an important one, in part because the 
government has not yet released its own detailed analysis of 
Brexit’s impacts. 

Lawmakers know that most researchers would like 
nothing more than for the United Kingdom to remaina 
member of the EU. Thatis one reason that The UK inaChang- 
ing Europe team, whichis one of just a handful of independ- 
ent analysts, is careful not to dwell on the impact of Brexit 
on the research community — but instead is keeping the 
focus on the bigger picture. 

As this Editorial went to press, the EU had agreed to 
a request from the UK government to delay Brexit to 
31January 2020 — three months beyond the recent, 31 Octo- 
ber, deadline. And with Prime Minister Boris Johnson and 
members of the Parliament at loggerheads over the terms 
of the exit, politicians were preparing for a general election. 
Researchers will have breathed a sigh of relief at avoiding an 


Nature | Vol 574 | 31 October 2019 | 597 


© 2019 Springer Nature Limited. All rights reserved. 


Editorials 


nature 


October ‘no-deal’, but few will be rejoicing. In what the EU 
calls a “flextension’”, Brexit could happen before 31January 
if Parliament approves a deal. 

And if a deal such as the one Johnson and the EU have 
agreed is ultimately passed, the worst case is that Britain 
leaves the free-trade area known as the customs union. Free 
movement of citizens to and fromthe EU and Britain will end, 
and Britain’s researchers might no longer be able to obtain 
funding from certain EU research programmes. 

That is the scenario policymakers are planning for. But 
as this journal — along with organizations representing 
researchers, such as the Royal Society — has repeatedly 
said, fracturing more than four decades of joint working 
between the United Kingdom and its nearest neighbours 
will damage both science and society. 

Aware of these concerns — and especially of the need to 
maintain scientific connections — the Johnson government 
has been talking up post-Brexit wins for research. 

It plans a more favourable visa regime for students and 
researchers, and is shaping new funds, including a UK 
version of the United States’ Defense Advanced Research 
Projects Agency. There’s also talk of agenerous European 
Research Council-style fund for UK researchers, should 
access to the EU scheme no longer be possible, and more 
funds to collaborate with the United States. And there’s 
confidence among some policymakers that the world’s 
researchers will continue to want to work with, and in, 
Britain. 

Such confidence is premature. A more welcoming visa 
regime and extra funding will help to placate some of 
researchers’ concerns, but new cash depends on how much 
the UK Treasury department has to spend, and that relies 
ontwo things that the country does not control. The first 
is how much Britain will have to pay the EU for any future 
relationship. The second — and more important — factor 
is that any funding increase for research needs the UK 
economy to continue to grow. Although the Treasury has 
carried out detailed economic-impact analyses of future 
growth, the chancellor of the exchequer, Sajid Javid, is not 
yet releasing the results. 

But thanks to modelling from UK ina Changing Europe, 
we know that, under Johnson’s proposals, income per 
capita is projected to be 2.5% lower on average than if Brit- 
ain remained an EU member, based on economists’ projec- 
tions of income from trade and reduced immigration. The 
team also says that when trade falls, which it will in the initial 
period after Brexit, that also reduces productivity. After 
factoring productivity losses into the models, post-Brexit 
income per capita could be between 2.3% and 7% lower. 

These figures call into question assumptions that Brexit 
will bring an economic dividend. And without such a 
dividend, the government will probably struggle to keep 
its promises of increased research funding. 

As the Brexit saga rolls on, researchers need to continue 
their objective analyses of its potential impacts, and to call 
out what could be prematurely optimistic promises. They 
must highlight the risks to research and ensure that none 
of these issues is trampled on inthe stampede to get a deal 
in place. 
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The slow 
growthof 
India’s tiger 
population 
is arare good 
news story.” 


Open data could 
Save more tigers 


India has a duty to give researchers access to 
the raw data on this threatened species. 


nGlobal Tiger Day inJuly, India’s government 

announced a victory. It declared that the 

nation is home to 2,967 wild tigers — a major 

increase onthe 1,872 animals recorded in1972. 

Centuries ago, tens of thousands of tigers 

roamed the world. Today, only six sub-species remain and 
the International Union for Conservation of Nature esti- 
mates that there are no more than 3,159 individuals in the 
wild. But, after centuries of hunting and habitat destruction, 
India’s tigers seem to be turning a corner. However, as we 
report on page 612, there’s much more that could be done. 

To begin with, tiger-conservation work must be improved 
in more of India’s 50 tiger reserves — the current effort is 
concentrated injust a handful. And the government must 
give scientists at India’s universities access to the reserves 
and the raw data on which its tiger estimates are based. 

At present, scientists cannot access the full data that the 
government collects during the national tiger census, which 
is conducted every four years. This is in spite of the fact that 
India’s National Data Sharing and Accessibility Policy states 
that scientific data collected using “public funds” should be 
made available to those with a legitimate research interest. 

If researchers were allowed to see the raw numbers, they 
couldindependently verifytiger population estimates.Such 
verification is essential to knowing whether conservation 
measures are working. It could also allow scientists to pro- 
ject population trends over time, and estimate birth and 
death rates. These measures would help officials in the 
government forestry department to assess populations 
more accurately and act quickly if they foresee a risk of local 
extinction. Such action might have helped to prevent the 
complete loss of tigers from three reserves that the 2018 
census reports. 

Researchers are also keen to get involved in, and improve, 
the census itself. The four-yearly survey is a gargantuan 
effort. The 2018 one covered 381,400 square kilometres and 
amounted to nearly 594,000 human-days of work. It logged 
35 million photographs taken with hidden motion-triggered 
cameras, yielding almost 77,000 images of tigers. 

But instead of trying to count tigers across such a vast 
area, individual reserves — where 70-85% of India’s tigers 
are thought to be found — could besampled more often, and 
automatic image recognition used to process the pictures. 

The slow growth of India’s tiger populationis a rare good 
news story ininternational conservation. But the Indian gov- 
ernment could be doing more. It must trust independent 
scientists with the raw data, so that one of Earth’s most iconic 
species can survive long into the future. 
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A personal take on science and society 


World view 


By Tracey Bretag 


Contract cheating will 
erode trust in science 


To combat academic dishonesty, focus on 4 r | 

educational systems and not just individual 

offenders, says Tracey Bretag. Scientists 
should not 


tories of students paying others to do their work deceive 
come from all around the world. In the 2015 themselves: 
MyMaster scandal in Australia, hundreds of stu- 
dents who were enrolled in more than a dozen they arenot 
universities paid a total of at least Aus$160,000 immune.” 
(US$108,000) to a ‘service’ that provided ghost-written 
essays and responses to online tests. In 2018, YouTube stars 
onmore than 250 channels received money for promoting 
acheating service called EduBirdie. Similar companies have 
been uncovered inthe United States and elsewhere. Scien- 
tists should not deceive themselves: they are not immune. 
Academics call this ‘contract cheating’. My colleagues 
and I have assembled whatis, to our knowledge, the largest 
data set on the topic — with responses from some 14,000 
students and 1,000 teachers across 8 Australian universities. 
We found that roughly 6% of students have engaged in the 
practice; that most who cheat doso more than once; and that 
both post- and undergraduate students engage in it. Cheat- 
ing is not new, but the proliferation of commercial, online 
services inthe past 5-10 years has made it easier than ever. 
And cheating is becoming increasingly normal. Since the 
1990s, universities around the world have reimagined them- 
selves as commercial enterprises that promote educational 
‘products’ to student ‘consumers’. In 2017, a commentator 
likened the brash marketing strategies of some UK univer- 
sities to the advertising of shampoo, and hundreds of aca- 
demic papers have openly criticized the ‘marketization’ of 
higher education. It’s no wonder students opt to take the 
most convenient route to an academic credential — just as 
they would shop around for any other deal. In our survey, 
morethan one-third of teachers specifically blamed contract 
cheating on the commercialization of higher education. 
Data from student surveys uncovered three factors 
associated with contract cheating: speaking English as a 
second language; thinking that there are lots of opportu- 
nities to cheat; and dissatisfaction with the educational 
environment. Two trends — dwindling teaching resources 
and lower linguistic and academic standards for admission 
— contribute to the situation. And although little research 
has been done onthe frequency of this phenomenoninsci- | Tracey Bretag is an 
entific researchers, those driven to outsourcetheir written | associate professor 
work as undergraduates will probably be tempted todoso | inthe School of 
as academics under pressure to ‘publish or perish’. Management at the 
Acursory Google search for ‘ghost-writing services for | University of South 
researchers’ identifies thousands of services offering com- | Australia in Adelaide. 
plete dissertations, grant applications, conference papers | e-mail: tracey. 
andjournalarticles. Outfits selling authorship onresearch | bretag@unisa.edu.au 


publications have been uncovered in China and Iran, and 
the market for ghost-written PhD dissertations is reportedly 
booming in Ukraine and seems healthy in Australia. 

We need to recognize that contract cheating is not just 
the responsibility of individual students, teachers or institu- 
tions. It is asystemic issue. Government funding agencies, 
regulatory authorities and leaders in higher education must 
tackle it. 

Some have made a good start. In 2018, New Zealand 
successfully prosecuted commercial cheating service 
Assignments4u, which paid NZ$2.1 million (US$1.3 million) 
inan out-of-court settlement and closed down. In April 2019, 
the Australian Department of Education introduced a draft 
bill targeting commercial services that advertise or pro- 
vide unauthorized assistance to students. It is expected to 
become law next year. Ireland has a similar law onits books. 
Such laws send aclear message. Cheating is not just unethi- 
cal, itis illegal — and it has consequences. Laws hold account- 
able the stakeholders that are essential for contract cheating 
over which educational institutions have no control. 

I feel that another important strategy is to reduce 
demand from within. Our team has found shockingly lit- 
tle concern in academics, including cheating students, 
non-cheating students and senior decision makers. They 
think that cheaters are just hurting themselves and are not 
damaging the community. 

A radical shift in rhetoric would help individuals see 
the value of actually doing their work. Institutions need 
to stop treating education as a product and refrain from 
determining the value of research by the amount of fund- 
ing received or the number of papers produced. Instead, 
they should focus on building academic cultures that are 
committed to integrity and that place abiding faith in the 
value of knowledge creation. 

We cannot simply tell people not to cheat. We must pro- 
vide support so that students feel capable of completing 
their assignments. That includes ensuring that institutions 
have appropriate language requirements for admission and 
allocating appropriate resources for teaching and learning. 

Contract cheating is also a threat to public safety. Itis not 
difficult to imagine how doctors, engineers and social work- 
ers who have outsourced their learning could pose a risk. 
This practice even threatens the common understanding of 
scientific facts — a big concern in the ‘fake news’ age. A large 
number of researchers purchasing their theses, publications 
and qualifications would endanger the credibility of science. 

To defend itself, the scientific community must rec- 
ognize that contract cheating is not an isolated problem 
caused by ‘bad apples’. It is an attack on core academic 
values that necessitates stronger leadership from govern- 
ment departments, funders, regulators and educational 
institutions. This threat requires a collective response. 
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The world this week 


News in brief 


TRUMP REVIVES 
WHITEHOUSE 
SCIENCE COUNCIL 


US President Donald Trump 

has resurrected the President’s 
Council of Advisors on Science 
and Technology (PCAST), 

more than two years after he 
took office. Trump signed an 
executive order on 22 October 
appointing seven members to 
the panel. All but one come from 
industry, rather than academia. 
The lone university researcher 
onthe panel so far is Birgitta 
Whaley, director of the Quantum 
Information and Computer 
Center at the University of 
California, Berkeley. 

The four other members 
of the panel who have PhDs 
in scientific fields include 
H. Fisk Johnson, the chief 
executive of cleaning-products 
firm S. C. Johnson in Racine, 
Wisconsin; Dario Gil, director 
of IBM Research in Yorktown 
Heights, New York; Sharon 
Hrynkow, senior vice-president 
for medical affairs at Cyclo 
Therapeutics in Gainesville, 
Florida; and A. N. Sreeram, 
chief technology officer for the 
chemical firm Dow in Midland, 
Michigan. 

The remaining members of 
the panel are Catherine Bessant, 
chief technology officer at 
Bank of America in Charlotte, 
North Carolina, and Shane 
Wall, chief technology officer 
at Hewlett Packard and global 
head of HP Labs in Vancouver, 
Washington. 

PCAST will be led by Kelvin 
Droegemeier, director of the 
White House Office of Science 
and Technology Policy and 
Trump’s science adviser. 
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Biogen share price (US$) 


Fresh push 
for ‘failed’ 
Alzheimer’s 
drug 
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Aducanumab shelved 
after trial failure 


Biogen announces 
plan to seek 
regulatory approval 


A drug for treating Alzheimer’s disease that the 
biotechnology company Biogen left for dead in March 
may get another chance to prove itself. 

Biogen, in Cambridge, Massachusetts, announced 
on 22 October that it would seek approval from the 
US Food and Drug Administration (FDA) for its drug 
aducanumab to treat early-stage Alzheimer’s disease. 
Biogen’s share price, which plummetedin March 
following news that the drug had failed, shot up again 
after the latest announcement. 

The company had halted development of 
aducanumab, an antibody that targets deposits of 
amyloid-f protein in the brain, because an early 
analysis of clinical-trial results suggested that it had 
no significant effects on clinical symptoms such as 
memory loss and disorientation. But Biogen has since 
evaluated new data from the same studies. These 
showed that, for a subset of patients, high doses of 
aducanumab given for an extended period significantly 
slowed cognitive decline. 

Scientists caution that FDA approval is not guaranteed. 


SOURCE: NASDAQ 
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WANTED: TOUGHER 
LAWS TO PROTECT 
AUSTRALIAN SPECIES 


More than 240 conservation 
scientists have called onthe 
Australian government to 
strengthen the country’s 
environment-protection laws 
to halt habitat destruction and 
species extinctions. 

In an open letter to Prime 
Minister Scott Morrison, the 
scientists wrote that Australia is 
inthe middle of an “extinction 
crisis”, citing evidence that 
in the past two centuries the 
continent has lost more than 
10% of its 273 native land- 
mammal species. Two mammal 
and one reptile species have 
died out inthe past decade. In 
addition, more than 1,800 plants 
and animal species are listed as 
threatened with extinction. 

The scientists’ call comes 
as a review of the national 
environment legislation starts 
this week. The Environment 
Protection and Biodiversity 
Conservation (EPBC) Act, 
established in 1999, must be 
reviewed at least once a decade. 

“Our current laws are failing 
because they are too weak, have 
inadequate review and approval 
processes, and are not overseen 
by an effective compliance 
regime,” the scientists wrote. 
Since 2000, more than 7 million 
hectares of habitat that could 
have housed threatened species 
has been wiped out. 

Environment minister Sussan 
Ley said that the review of 
the EPBC Act will encourage 
submissions, suchas the 
letter, as well as input from 
agricultural and industrial 
perspectives. “The government 
is investing significantly in 
environmental restoration and 
land care programs to promote 
biodiversity and safe havens for 
our native species,” Ley said. 


CANADIANKIDS SUE 
GOVERNMENT OVER 
CLIMATE CHANGE 


A group of children and 

young adults filed a lawsuit 

on 25 October alleging that 

the Canadian government has 
violated their constitutional 
rights by promoting 

and enabling fossil-fuel 
development in spite of 
acknowledged risks from global 
warming. 

Fifteen people aged between 
10 and 19 filed the lawsuit in 
federal court, arguing that 
climate change will impinge 
on their right to “life, liberty 
and security”. The lawsuit also 
argues that climate change will 
interfere with basic equality 
rights, given that the most 
severe effects of climate 
change will be borne by future 
generations. 

“The courts must hold this 
generation to account for harms 
that are being done to the next,” 
says Chris Tollefson, co-counsel 
for the plaintiffs and a specialist 
in environmental law at the 
University of Victoria in Canada. 

Ira Reinhart-Smith, a 
15-year-old plaintiff from 
Caledonia, Canada, got involved 
with climate activism last year. 
“My generation and generations 
to come are going to be exposed 
to things that the world has 
never been exposed to before,” 
he says. 

The lawsuit is the latest ina 
series of legal challenges aiming 
to force governments around 
the world to act on climate 
change. 


STARCRAFT II 
Al OUTPLAYS 
TOP-RATED GAMERS 


Players of the video game 
StarCraft II faced an unusual 
opponent this summer. An 

Al called AlphaStar — built by 
Google’s artificial intelligence 
firm DeepMind — was unleashed 
onthe game’s European servers, 
where it achieved a grandmaster 
rating, placing it within the top 
0.15% of the region’s 90,000 
players. This result, reported 

on 29 October (O. Vinyals etal. 
Nature https://doi.org/10.1038/ 
$41586-019-1724-z; 2019), 
shows for the first time that an 
Alcan play at the highest levels 
of StarCraft II, which poses 
different challenges from other 
games Als have aced. 

StarCraft II players wage 
futuristic warfare by controlling 
human and alien armies in 
real time. Experienced players 
multitask by managing 
resources, executing complex 
combat manoeuvres and 
out-strategizing their opponent. 

AlphaStar uses similar 
machine-learning techniques 
to Als that DeepMind built to 
play chess and Go. They rely on 
neural networks, which learn to 
recognize patterns, and could 
be applied to other Al problems, 
suchas self-driving cars. 

AlphaStar crushed low- 
ranking opponents and amassed 
61 wins out of 90 games against 
top-rated players, although 
it wasn’t able to beat the best 
human player in the world. 


Nature | 
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SNIPERS IN SOUTH 
KOREA TARGET BOARS 
WITH DEADLY VIRUS 


South Korea has mobilized 
military snipers and drones 
along the demilitarized zone 
between itselfand North Korea 
to stave off wild boars carrying 
African swine fever. 

Cases of the highly contagious 
and lethal virus in pigs started 
to appear in South Korea last 
month. Authorities there have 
confirmed 15 cases in wild boars 
and 14 in domestic pigs on farms 
near the border with North 
Korea. There is no vaccine or 
treatment for the virus. 

The nation has culled more 
than 150,000 pigs so far to 
contain the spread, says the 
agriculture ministry. The 
defence ministry says it has 
mobilized civilian teams of 
hunters, as well as military 
snipers, to take down wild boars 
near the North Korean border, 
according to South Korean 
media. 

The Food and Agriculture 
Organization of the United 
Nations says that at least ten 
countries in Asia have ongoing 
outbreaks of the virus, which has 
wiped out millions of pigs. 

The disease’s spread comes 
as scientists in China reported 
on17 October that they have 
obtained the most detailed 
picture yet of the virus’s 
structure, which could assist 
vaccine development (N. Wang 
et al. Science http://doi.org/ 
ddbp; 2019). 
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News in focus 
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Archived weather data are helping to produce more-accurate climate predictions, such as the extent of rainfall in Mali. 


SCIENTISTS STRUGGLE 
TO ACCESS AFRICA'S 
HISTORICAL CLIMATE DATA 


Better climate predictions require Africa’s weather agencies to open up their 
archives. But commercial concerns and a lack of trust are holding them back. 


By Linda Nordling 


or principal meteorologist Grieffy John 
Stegling, the storerooms at Botswana’s 
national weather-service headquarters 
in Gaborone hold a rare treasure: floor- 
to-ceiling shelves containing boxes of 
old notebooks with carefully recorded weather 
observations going back more thanacentury. 
Such records offer clues not only to the 
country’s past, but also to the future of its cli- 
mate. Like most African countries, Botswana 
is illserved by global climate models, because 
predictions are based on patchy records of 
key variables such as temperature, humidity 
and atmospheric pressure (E. Archer et al. 
Biodivers. Ecol. 6, 14-21; 2018). 


“Historical climate data over Africa are 
very valuable for understanding climate 
variability and trends,” says Chris Taylor, a 
meteorologist at the Centre for Ecology and 
Hydrology in Wallingford, UK, who studies 
African climate trends. 

In 2017, Taylor and his team found that 
climate change will increase extreme rain- 
fall in the Sahel, a semi-arid region south of 
the Sahara Desert (C. M. Taylor et al. Nature 
544, 475-478; 2017). A crucial part of their 
study involved cobbling together histori- 
cal records — some of them “locked away in 
cupboards” — from different national weather 
services, Taylor says. “Having a historical 
baseline is a prerequisite for understanding 
how intense rainfall is changing,” he says. 
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Since 2015, the World Meteorological 
Organization in Geneva, Switzerland, and 
Germany’s weather service, Wetterdienst, 
have provided training and equipment to 
help Botswana digitize and share its historical 
climate data. But because there are no dedi- 
cated staff members, progress has been slow. 
Of 2 million records, only 100,000 have been 
processed. “If we had more manpower, it would 
go much faster,” Stegling says. 

Whereas Botswana is making some progress, 
in other meteorological offices across 
Africa, millions of records are mouldering in 
cardboard boxes or languishing on obsolete 
technology. Digitization efforts have been held 
up because of concerns that giving researchers 
free access to the data will prevent such offices 
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from making money by selling the information. 

The South African Weather Service (SAWS) 
has turned down offers by the International 
Data Rescue (I-DARE) project to help digitize 
historical climate data because the agency 
wants to be able to sellits data. “If unrestricted 
access to the National Climatological Databank, 
of which SAWS is the custodian, is allowed, 
SAWS might not be able to deliver on its com- 
mercial mandate,” aspokespersontold Nature. 

Similar concerns are holding up the digi- 
tization of 2 million surface observations — 
including temperature, rainfall and humidity 
— from 48 African countries. Those data are 
stored at the African Centre of Meteorologi- 
cal Applications for Development (ACMAD) 
in Niamey, Niger. 

“The private sector is progressively being 
involved in climate-services delivery,” says 
ACMAD director-general Andre Kamga 
Foamouhoue, and this sometimes creates 
conflicts ofinterest with government agencies 
looking to commercialize data. 

Many of the data-rescue requests come from 
initiatives led by individuals or institutions 
from Europe or the United States, says Jane 
Olwoch, executive director of the Southern 
African Science Service Centre for Climate and 
Land Management, aregional climate research 
centre in Windhoek, Namibia. 

And that can be a problem because 
institutions in African countries aren’t sure 
how they will benefit if the data expertise comes 
from outside the continent. She hopes that 
data-rescue efforts fronted by her own organi- 
zation, in Angolaand Botswana, will be viewed 
with less suspicion because the organization is 
backed by four southern African governments 
and has local headquarters and staff, even 
though much of the funding comes from the 
German government. 


Recovering old records 


Notall of Africa’s climate records are in Africa, 
however. Many of the oldest ones were collected 
by professional and amateur meteorologists 
who came to Africa from Europe during 
colonial times. Stefan Grab, a geographer 
at the University of the Witwatersrand in 
Johannesburg, South Africa, says that, para- 
doxically, these records can be easier to access 
than local ones. 

South Africa has the Southern Hemisphere’s 
longest uninterrupted weather observations, 
recorded at the astronomical observatory 
in Cape Town. It was thought that these data 
stretched back to 1841, but Grab, who leads 
South Africa’s data-rescue efforts, knew 
that astronomers had been in the Cape since 
the 1830s. So he contacted staff at the Royal 
Greenwich Observatory in London, who 
directed him tothe archives at the University of 
Cambridge, UK. “Lo and behold, they found the 
earliest records, which go back to 1834,” he says. 

ACMAD’s Kamga Foamouhoue says that 
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weather agencies must be persuaded of the 
benefits of mining, and then sharing, historical 
data with other scientists, and that the biggest 
benefit is more-accurate climate predictions. 


“Anything that’s really old, like from the 
nineteenth century, is extremely valuable,” 
Grab emphasizes. “It’s worth far more than 
gold and diamonds.” 


CANADIAN RESEARCHERS 
RELIEVED AS TRUDEAU 
EKES OUT ELECTION WIN 


Government's need to rely on progressive parties 
bodes well for climate policies and science funding. 


By Brian Owens 


anada's Prime Minister Justin Trudeau 
won a second term on 21 October, 
but his Liberal Party lost its majority 
in parliament. The Liberals and the 
Conservative Party were locked ina 
dead heat leading up to election night, andthe 
outcome of the closely fought race brought a 
sense of relief for many researchers in Canada. 

Apart from climate change, science issues 
didn’t feature prominently inthe election cam- 
paign. But researchers are hopeful that several 
of the parties that won seats in parliament will 
work together to pass climate-change policies 
and increase science funding. 

The left-leaning Liberal Party — which has 
hada mixed, but generally positive track record 
with respect to science — won 157 seats in par- 
liament, 13 shy of a majority. The Conservatives 
won the second-largest number of seats at 121. 
This means that the Liberals will need to rely 
on other minority parties in parliament, spe- 
cifically the New Democratic Party (NDP) and 
the Green Party, to help pass legislation. Such 
policies could include the Liberal proposal of 


Justin Trudeau leads Canada’s Liberal Party. 
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reaching net-zero carbon emissions by 2050. 

Cathleen Crudden, a chemist at Queen’s 
University in Kingston, Canada, says that 
there was a real possibility that the Conserva- 
tives would win, and that they would undo the 
progress in science made over the past four 
years under the Liberal government. “What 
the Liberals started under their last mandate 
now hasan opportunity to continue,’ she says. 

The government’s reliance on minority 
parties such as the NDP could be especially 
helpful to scientists when it comes to issues 
such as funding, says Molly Sung, a chemist 
at the University of Toronto in Canada who 
supported the NDP in the election. During the 
campaign, the NDP committed to boosting 
funding for basic research by Can$80 million 
(US$61 million) per year, she says. Sung thinks 
that their influence in a minority government 
could result in more money for science. 

The NDP, along with the Liberals and 
the Greens, also supports the existing 
science-integrity policy that ensures govern- 
ment researchers can speak freely about their 
work. And all three parties have expressed 
interest in keeping the science-adviser position 
created by Trudeau in 2017. 

Despite the agreement between the three 
progressive parties on science and climate 
issues, analysts also expect some friction. The 
NDP and the Greens say that they are open to 
cooperating with the Liberals. But they will 
probably demand concessions for their sup- 
port — suchas more-ambitious climate targets 
or increases inthe country’s carbon tax — that 
the Liberals might not be comfortable with. 

It’s important that researchers keep working 
to focus the government's attention on science 
issues, says Katie Gibbs, executive director of 
the campaign group Evidence for Democracy 
in Ottawa. Minority governments tend to be 
messy as different interests jockey for power, 
so strong advocacy for science will be especially 
crucial, Gibbs says. “The science community 
is going to have to work hard and be loud to 
ensure these concerns don't fall offthe agenda.” 
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Historians want to build a virtual archive for Venice, pictured here in the eighteenth century. 


UO HA ae 


VENICE ‘TIME MACHINE’ 
PROJECT SUSPENDED 


AMID DATAROW 


Disagreements between international partners leave 
plans to digitize the Italian city’s history in limbo. 


By Davide Castelvecchi 


ike the city itself, an ambitious effort 

to digitize ten centuries’ worth of 

documents that record the history of 

Venice is at risk of sinking. Two key part- 

ners have suspended the Venice Time 
Machine project after reaching an impasse 
over issues surrounding open data and meth- 
odology. The State Archive of Venice and the 
Swiss Federal Institute of Technology in Lau- 
sanne (EPFL) say they have had to pause data 
collection, and the archive’s director has raised 
questions about the usability of the informa- 
tion that has already been collected. 

The project sought to digitize documents 
that stretch over 80 kilometres of shelves in 
the state archive. These record the minutiae 
of the city’s administration — from financial 
transactions to citizens’ addresses and family 
connections — during its heyday inthe Middle 
Ages and the Renaissance as a republic that 
dominated trade inthe eastern Mediterranean 
for centuries. Many are written in Latin or the 
Venetian dialect, and have never been read by 
modern historians. 


The goal was to make this information freely 
available online to researchers worldwide. The 
project also aimed to push the state-of-the-art 
in text-recognition technology for handwrit- 
ten documents, using machine learning to 
automatically read millions of pages and tag 
their contents so that historians could perform 
quick searches. 

The project was launched asacollaboration 
between EPFL, the State Archive of Venice and 
the Ca’ Foscari University of Venice. In 2014, 
all three organizations signed a non-binding 
memorandum of understanding on how the 
work would be conducted. 

However, the original agreement left out 
crucial details on the research protocols, 
according to a 19 September press release 
fromthe archive announcing the suspension. 
In particular, it didn’t specify the type of licens- 
ing that would regulate researchers’ use of the 
digitized data — which must also comply with 
Italian law, says the archive’s current director, 
Gianni Penzo Doria. He adds that he tried to 
jump-start negotiations for a detailed contract 
after he took up his post in August, but the two 
sides came to animpasse. The decision to halt 
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the project was inevitable, he says, and mutual. 

But on 23 September, EPFL issued its own 
sharply worded press release saying that the 
archive had suspended the project unilater- 
ally, and that EPFL was surprised to learn of the 
decision from the archive’s website. “I think it’s 
essentially a misunderstanding,” says Frédéric 
Kaplan, a computer scientist at EPFL and direc- 
tor of the Venice Time Machine. He adds that 
the disagreement could potentially have been 
resolved by face-to-face meetings between the 
collaborators, but that so far all discussion had 
been by teleconference. 


‘Useless’ files 


Meanwhile, the fate of the 8 terabytes of data 
accumulated over the past 5 years — from 
around 190,000 documents — is unclear. 
Penzo Doria claims that from the point of 
view of archival science, “these files are use- 
less”, because their digitization did not follow 
guidelines set by the International Researchon 
Permanent Authentic Records in Electronic 
Systems (InterPARES) project. 

These guidelines mandate the scrupulous 
recording of information that certifies the 
provenance of each document, and require 
that a record of suchinformation be keptinthe 
metadata that comes with each file. This serves 
as asort of electronic signature that ensures 
the long-term preservation and validation ofa 
digital file. According to Penzo Doria, the EPFL 
researchers who made the scans did not doc- 
ument how they collected such information 
— or, if they did, they didn’t share such docu- 
mentation with collaborators at the archive. 

Kaplan says the researchers did collect 
metadata, but based their methodology ona 
the International Standard Archival Descrip- 
tion guidelines from the International Coun- 
cilon Archives. He says that EPFL researchers 
followed procedures established by the state 
archive’s own staff. Kaplan also says that he 
provided documentation on the metadata in 
an e-mail to Penzo Doria’s predecessor, Gio- 
vanna Giubbini, in February 2019. Penzo Doria 
and Giubbini both told Nature that they never 
received this documentation. 

Raffaele Santoro, who was director of the 
State Archive of Venice in 2014 when the 
memorandum on the Time Machine project 
was signed, says that he doesn’t know pre- 
cisely how workers collected the metadata, 
but he assumes that they are scientifically 
valid because the archive’s own staff were 
closely involved in the process. To make the 
documents that have already been digitized 
compliant with additional standards, one 
could simply add more information to the 
metadata, he says, “without any need to do 
it allover again”. 

Kaplan says he is hopeful the project can get 
back on track if the two sides meet to discuss 
newtermsin person. “EPFL sincerely hopes the 
meeting will happen soon,” he says. 
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News in focus 


Black people were less likely than white people to be sent for personalized care, a study found. 


MILLIONS AFFECTED 
BY RACIAL BIAS IN 
HEALTH-CARE ALGORITHM 


Study reveals widespread racism in decision- 
making software used by US hospitals. 


By Heidi Ledford 


nalgorithm widely used in US hospi- 
tals to allocate health care to patients 
has been systematically discriminat- 
ing against black people, asweeping 
analysis has found. 

The study, published in Science on 24 Octo- 
ber, concluded that the algorithm was less 
likely to refer black people than white people 
who were equally sick to programmes that 
aimtoimprove care for patients with complex 
medical needs (Z. Obermeyer et al. Science 
366, 447-453; 2019). Hospitals and insurers 
use the algorithm and others like it to help to 
manage care for about 200 million people in 
the United States each year. 

This type of study is rare, because research- 
ers often cannot gain access to proprietary 
algorithms and the reams of sensitive health 
data needed to fully test them, says Milena 
Gianfrancesco, an epidemiologist at the Uni- 
versity of California, San Francisco, who has 
studied sources of bias in electronic medical 
records. But smaller studies and anecdotal 
reports have documented unfair and biased 
decision-making by algorithms used in 
everything from criminal justice to education 
and health care. 

“It is alarming,” says Gianfrancesco of 
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the latest study. “At the same time, it’s not 
surprising.” 

Ziad Obermeyer, who studies machine 
learning and health-care management at the 
University of California, Berkeley, and his team 
stumbled across the problem while examin- 
ing the impact of programmes that provide 
additional resources and closer medical super- 
vision for people with multiple, sometimes 
overlapping, health problems. 

When Obermeyer and his colleagues ran rou- 
tine statistical checks on data they received 
from a large hospital, they were surprised to 
find that people who self-identified as black 
were generally assigned lower risk scores than 
equally sick white people. Asa result, the black 
people were less likely to be referred to the pro- 
grammes that provide more-personalized care. 

The researchers found that the algorithm 
assigned risk scores to patients on the basis 
of total health-care costs accrued in one year. 
They say that this assumption might have 
seemed reasonable because higher health- 
care costs are generally associated with greater 
health needs. The average black person inthe 
data set that the scientists used had similar 
health-care costs to the average white person. 

But acloser look at the data revealed that the 
average black person was also substantially 
sicker than the average white person, with 
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a greater prevalence of conditions such as 
diabetes, anaemia, kidney failure and high 
blood pressure. Taken together, the data 
showed that the care provided to black people 
cost an average of US$1,800 less per year than 
the care given to white people with the same 
number of chronic health problems. 

The scientists speculate that this reduced 
access to care is due tothe effects of systemic 
racism, ranging from distrust of the health- 
care system to direct racial discrimination by 
health-care providers. 

And because the algorithm assigned people 
to high-risk categories on the basis of costs, 
those biases were passed on inits results: black 
people had to be sicker than white people 
before being referred for additional help. Only 
17.7% of patients that the algorithm assigned to 
receive extra care were black. The researchers 
calculate that the proportion would have been 
46.5% ifthe algorithm was unbiased. 

When Obermeyer and his team reported 
their findings to the algorithm’s develop- 
ers — Optum of Eden Prairie, Minnesota — the 
company repeated their analysis and got the 
same results. Obermeyer is working with the 
firm without salary to improve the algorithm. 

He and his team collaborated with the 
company to find variables other than health- 
care costs that could be used to calculate a 
person’s medical needs, and repeated their 
analysis after tweaking the algorithm accord- 
ingly. They found that making these changes 
reduced bias by 84%. 

“We appreciate the researchers’ work,” 
Optum said in a statement. But the company 
added that it considered the study’s conclu- 
sion to be “misleading”. “The cost model is 
just one of many data elements intended to be 
used to select patients for clinical engagement 
programs.” 

Obermeyer says that using cost prediction 
to make decisions about patient engagement 
is a pervasive issue. “This is not a problem with 
one algorithm, or one company — it’s a prob- 
lem with how our entire system approaches 
this problem,” he says. 


Examining assumptions 


Correcting bias in algorithms is not straight- 
forward, Obermeyer adds. “Those solutions 
are easy in asoftware-engineering sense: you 
just rerunthe algorithm with another variable,” 
he says. “But the hard part is: what is that other 
variable? How do you work around the bias 
and injustice that is inherent in that society?” 

This is in part because of a lack of diversity 
among algorithm designers, andalack of train- 
ing about the social and historical context of 
their work, says Ruha Benjamin, author of Race 
After Technology (2019) and a sociologist at 
Princeton University in NewJersey. 

“We can’t rely on the people who currently 
design these systems to fully anticipate or 
mitigate all the harms associated with 
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automation,” she says. 

Developers should routinely run tests such 
as those performed by Obermeyer’s group 
before they deploy an algorithm that affects 
human lives, says Rayid Ghani, a computer 
scientist at Carnegie Mellon University in 
Pittsburgh, Pennsylvania. That kind of auditing 
is more common now, he says, since reports 
of biased algorithms have increased. 

He thinks that the results of these audits 
should always be compared to human 


decision-making. Unpublished analyses by 
Ghani’s team have compared algorithms used 
in public health, criminal justice and education 
to human decision-making, and found that the 
machine-learning systems were biased — but 
less so than the people. 

“We are still using these algorithms called 
humans that are really biased,” says Ghani. 
“We've tested them and known that they’re 
horrible, but we still use them to make really 
important decisions every day.” 


‘GAY GENE’ APP 


PROVOKES FEARS OF A 
GENETIC WILD WEST 


Debate highlights broader concerns about apps that 
use the results of direct-to-consumer genetic testing. 


By Amy Maxmen 


oseph Vitti’s stomach turned when he 

opened a link an acquaintance had sent 

him. It took him to an app called ‘How 

Gay Are You?’ that purported to gauge 

aperson’s level of attraction to others of 
the same sex, according to their genes. 

The app’s creator, Joel Bellenson, a US entre- 
preneur living in Kampala, Uganda, based the 
test on the findings of a massive study on the 
genetics of same-sex sexual behaviour — even 
though the analysis, published in Science in 
August, concluded that a person’s genes can- 
not predict their sexuality (A. Ganna et al. 
Science 365, eaat7693; 2019). 

Vitti, a computational geneticist at the 
Broad Institute in Cambridge, Massachusetts, 
thinks the app was misleading — even danger- 
ous. “There are vulnerable queer people all 
over the world,” says Vitti, “and this app stands 
to hurt them.” On 11 October, he started an 
online petition to remove the test. Within two 
weeks, more than 1,660 people had signed it. 

Bellenson says that the idea his test could 
endanger people is an “absurd scenario” and 
notes that the test also included a warning that 
it could not predict same-sex attraction. 

But the furore over his app highlights a 
growing problem in the field of genetics. 
Researchers conduct statistically sophisti- 
cated analyses of hundreds of thousands of 
genomes, searching for associations between 
genetic variations and diseases, behaviours or 
other characteristics. Anyone can take the var- 
iations identified by such studies, strip them 
of caveats and nuance, and market a simple 
genetic-interpretation tool online. 

Scientists and genetic counsellors say that 


these unregulated tools can harm individu- 
als and society, causing anxiety, unneces- 
sary medical expenses, stigmatization and 
worse. “It’s the Wild West of genetics,” says 
Erin Demo, a genetic counsellor at Sibley Heart 
Center Cardiology in Atlanta, Georgia. “This is 
just going to get harder and harder.” 
Bellenson posted his app on GenePlaza, an 
online marketplace for DNA-interpretation 
tools, in early October. For US$5.50, a person 
could upload their genetic data — as supplied 
by consumer DNA sequencing companies 
such as 23andMe of Mountain View, Califor- 
nia — and the app would place them along a 
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gradient of same-sex attraction. The app cited 
the Science study and warned users that it did 
not predict same-sex attraction. 

The researchers behind the Science study 
say that Bellenson’s app misrepresents their 
work. The test “is not grounded in science. It 
is not predictive. It won’t tell you anything”, 
says Benjamin Neale, a geneticist at the Broad 
Institute and an author of the Science analysis. 
He and his colleagues examined the DNA of 
around 475,000 people and found 5 genetic 
variations loosely correlated with people who 
said they’d had sex with someone of the same 
sex at least once. But none of the variants was 
so prevalent that the researchers could use 
them to predict a person’s sexual identity. 

Neale sent a letter to GenePlaza on 
14 October asking that it take down the app 
— or remove references to his study. The next 
week, Bellenson renamed the app ‘122 Shades 
of Gray’ and added a note explaining that the 
authors of the Science study weren't affiliated 
with the project. He says that because the app 
has always warned users that it is not predic- 
tive, it does not misrepresent the study. 

But the chorus of angry scientists on Twitter 
grew louder. Some echoed Vitti’s concern that 
the app could be abused. In his petition, Vitti 
noted that Bellenson lives in Uganda, where 
gay sex is punishable by life in prison. Vitti 
worried that, regardless of the app’s scientific 
flaws, Ugandan authorities could get hold ofa 
person’s results and use them as evidence of 
sexual preferences. 

Bellenson says that there are much simpler 
ways of discovering a person’s sexual prefer- 
ence, such as looking at their social-media 
accounts. “The idea that a government would 
need a DNA test to figure out if someoneis gay 
is ridiculous,” he says. 
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Millions of people have had their DNA sequenced by consumer genetic-testing companies. 
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News in focus 


On 24 October, GenePlaza co-founder Alain 
Coletta removed the app from his platform. He 
and Bellenson both say they did not intend to 
hurt anyone by making the app available. And 
they echo other creators of third-party tools 
that interpret DNA sequencing data, who say 
that even if their tests aren’t predictive, they 
encourage public engagement in science. “It 
may not be much better than a horoscope or 
atarot-card reading, but at least it lets bioin- 
formatics be something fun,” Bellenson says. 

This argument concerns genetic counsel- 
lors, who have seen a surge in the number of 
people seeking help for conditions that third- 
party tools have identified in their DNA — often 
inaccurately. 

Tens of millions of people worldwide have 
now had their DNA sequenced by direct-to- 
consumer companies. But these sequencing 
companies only highlight certain genetic asso- 
ciations. If customers want more information, 
they can download their raw genetic data from 
these firms’ sites for further exploration. 

Up to 62% of customers ultimately upload 
their genetic data to third-party websites for 
asmall fee, a study published in August found 
(T. Moscarello et al. Genet. Med. 21, 539-541; 
2019). GenePlaza, for example, offers DNA-in- 
terpretation apps that purport to assess intel- 
ligence, neuroticism and taste perception. 
Other websites advertise services that use 


DNA to explore a person’s ancestry, disease 
risk, ideal romantic partner, fondness for mari- 
juana, nutritional needs, sleep habits and more. 

In 2015, 23andMe learnt that its custom- 
ers could feed their DNA data directly from 
23andMe’s servers into a secondary appli- 
cation associated with white supremacists 


“Scientists havea 
responsibility to describe the 
human condition in amore 
nuanced and deeper way.” 


that evaluated a person’s degree of European 
ancestry. 23andMe shut downthe app’s access 
to data onits servers. The company went fur- 
ther last year by restricting direct access to its 
data to select collaborators. 

23andMealso warns customers who down- 
load their data it cannot ensure the accuracy of 
third-party interpretation tools. Developers of 
these tests might base them on genome-wide 
analyses that find weak correlations, or asso- 
ciations that have been contradicted by addi- 
tional analyses. 

But Vitti thinks scientists should bear more 
responsibility for how their results are used — 
especially now that geneticists are delving 
deeper into social and behavioural traits, 


suchas links between a person’s DNA and their 
political persuasion. 

He argues that ethical review boards should 
assess whether the benefits of such studies 
outweigh the potential for harm. Genome- 
wide analyses are not scrutinized to the same 
degree as research on individuals because the 
data they rely onare pooled and anonymized. 
But the How Gay Are You? app illustrates how 
such analyses could lead to harmful outcomes, 
Vitti says. 

Despite his distaste for the app based on 
his study, Neale says research must go on. 
“Scientists have a responsibility to describe 
the human condition in a more nuanced and 
deeper way,” he says. 

But Sarah Nelson, a geneticist at the Univer- 
sity of Washington, Seattle, who has studied 
third-party interpretation tools, worries that 
her peers aren’t fully aware how difficult their 
studies can be for the public to understand. 
Even if researchers take pains to explain that 
their genome-wide analyses aren't predictive, 
she says, companies can still use the science as 
they please — and the barrier to entry is low. 

Indeed, Bellenson says he whipped together 
his app in a weekend. He knew enough about 
genetics and computer programming towrite 
an algorithm, and find a home for it online. 
“Genetics and bioinformatics is so mature,” 
he says. “Academia can no longer control it.” 
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A tiger and her cubs leave India’s Bandhavgarh National Park in search of prey, which is scarce inside the reserve. 


TIGER TROUBLE 


612 | Nature | Vol574 | 31 October 2019 


© 2019 Springer Nature Limited. All rights reserved. 


India is trying to save its tiger 
population, but researchers question 
the country’s long-term plans to 
protect this endangered species. 

By Gayathri Vaidyanathan 
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entral India — The Maruti Gypsy 4x4 

sped along ajungle track, jolting us 

out of our seats. We had signed up 

for a wolf safari, but the trip leader 

had another quarry in mind. The 

vehicle barrelled towards a pungent 

smell ona hillside —a fresh tiger kill. 

The forest guide spoke to one of 

his colleagues in a different vehicle and then 
barked at our driver to rush towards a nearby 
meadow. A tigress and four cubs are at a 
watering hole just beyond our sights, he said. 

A full Moon rose, and revealed an ink-blue 
landscape. Handheld lights were banned, so 
visibility was at 3 metres. The phone rang, and 
the guide instructed the driver, who raced on 
a rollercoaster route back to the kill site. No 
tiger. We dashed back to the meadow, asecond 
vehicle in hot pursuit. It felt ugly, like a hunt. 

Two circuits later, the Moon was high over 
the meadow when we were beckoned once 
more back tothe kill site. We raced there to find 
four Gypsys, the drivers using their headlights 
to sweep the hillside. Another vehicle banged 
into ours. Our guide cursed. Then silence, as 
the drivers shut off the engines. Tourists stood 
on seats, peering through telephoto lenses. 

Footsteps rustled dead leaves, andthe drivers 
switched on the high beams. There sat two 
tigers, larger than life as wild tigers are. These 
were no cubs; they were male adolescents. Cam- 
era shutters clicked. Minutes later, the animals 
got up and disappeared into the darkness. 

Two hundred years ago, tens of thousands 
of tigers (Panthera tigris) roamed India and 
29 other nations, from the Indonesian swamps 
tothe Russian taiga. There were once Balinese, 
Caspian and Javanese subspecies, all now con- 
sidered extinct. Today, only six subspecies 
remain. The International Union for Conser- 
vation of Nature (IUCN) estimated in 2014 that 
there are only about 2,200 to 3,200 individuals 
in the wild, placing the animal onthe organiza- 
tion’s endangered list. About 93% of the tiger’s 
historic range has emptied owing to habitat 
loss, poaching and depletion of prey. 

The spectre of a world without tigers led 
13 nations to meet in 2010 in St Petersburg, 
Russia, where they declared that they would 
double their wild tiger numbers by 2022. But all 
except India, Nepal and Bhutan are struggling 
tosave their tigers, evenin protected reserves. 

Against this backdrop, India is the beacon. 
It has roughly two-thirds of the world’s tigers 
in less than one-quarter of their global range. 
In 2019, it has invested 3.5 billion rupees 
(US$49.4 million) in tiger conservation, includ- 
ing relocating villages outside protected areas. 
Andithas built the world’s largest animal under- 
pass to funnel tigers safely beneath a highway. 

About 3% of the spending on tigers is flowing 
to government-sponsored science. Govern- 
ment scientists are studying all aspects of the 
animal, and are heading a large tracking study 
to understand tiger behaviour. 
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The efforts have paid off, according to the 
government. It announced inJuly that the num- 
ber of wild tigers in the country had doubled 
from 1,411 in 2006 to 2,967 today — meaning 
that India has met the St Petersburg target. 
Indian Prime Minister Narendra Modi declared 
that tiger conservation could go hand in hand 
with building roads, railways and homes. 

But parse the country’s tiger data, and the 
story becomes murky. The animals are increas- 
ingly becoming isolated in small reserves that 
prioritize tourism. If the cats leave the parks, 
the risks are rising that they will encounter 
humans and infrastructure, with tragic results 
for both the animals and people. Some scien- 
tists question whether tiger numbers in India 
have truly increased and are attempting togeta 
more accurate count of populations in specific 
areas. Other researchers are studying how to 
get people and the carnivores to coexist. 

Saving tigers is difficult enough, but research 
efforts in India are made more challenging 
by an apparent antagonism between the 
actors involved. Some experts charge that 
government scientists sometimes present 
questionable evidence in support of state 
policies and hamper efforts by independent 
investigators. Such conflicts are routine in tiger 
conservation globally, says John Goodrich, 
who heads the tiger programme at Panthera, 
aconservation organization in New York City. 

“It’s something that I’ve been incredibly 
frustrated with,” he says. “We all have all this 
data, all this knowledge that we need to be 
sharing.” 


The national animal 

Two hundred years ago, an estimated 
58,000 tigers roamed India’s lush, unbro- 
ken forests’. But centuries of hunting and 


habitat destruction left fewer than 2,000 wild 
individuals by the 1970s. In 1973, the gov- 
ernment declared the tiger India’s national 
animal, banned hunting and set up a conser- 
vation scheme called Project Tiger. There are 
50 reserves today under the programme, and 
about half are well managed, according toa 
government assessment. But the reserves 
are small, averaging less than 1,500 square 
kilometres — much smaller than many 
protected areas in Africa. 

These are unfavourable conditions for the 
solitary tiger. Male Bengal tigers need ahome 
range of about 60-150 km’, whereas females 
use about 20- 60 km”. And tigers do not share 
easily, even with siblings or kids.So whenacub 
hits adolescence at about one anda half years, 
it begins roaming to find territory in whichto 
live and hunt. If the tiger reserve is already 
full, it has two options: either push out an old 
or weak tiger and take over the space, or keep 
moving well outside the reserve until it finds 
unoccupied territory. It is thought that 70-85% 
of India’s tigers are inside reserves. 

These numbers are from India’s tiger census. 
Every four years, an army of forest guards, con- 
servationists and volunteers fans out over an 
area roughly the size of Japan and carries out 
a comprehensive census. It’s a difficult task 
because tigers are elusive. The workers place 
camera traps in some parts of tiger reserves 
for about 35 days. Then they walk on foot, col- 
lecting sightings of tiger tracks, scat and signs 
of prey and human disturbance. This is called 
asign survey. They send the data to scientists 
at the government-run Wildlife Institute of 
India (WII) in Dehradun, who identify individual 
tigers in photos from their unique stripe pat- 
terns and then estimate local tiger densities in 
reserves. They create a calibration model that 
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links the tiger densities to the collected signs, 
then input the sign-survey data into this model 
to derive nationwide numbers. 

“Unless you know what you have and 
where you have it, you can’t manage it,” says 
YadvendradevJhala, who heads the tiger team 
at the WII and is responsible for the survey. 

The latest census suggests that tigers are 
rebounding, and Modi celebrated a 33% 
increase in numbers since 2014. 

But many scientists are sceptical. Ullas 
Karanth, director of the Centre for Wildlife 
Studies in Bengaluru, questions the sign sur- 
veys, which he says are collected by ill-trained 
workers who don’t know how to do accurate 
counts — an accusation he based on his own 
experiences with the field workers. “The field 
protocols are deeply flawed,” Karanth says. 
When I walked with forest guards doing surveys 
inareserve in May, they said they felt pressured 
by local officials to record positive tiger signs 
and ignore signs of human disturbance. 

Critics also argue that Jhala’s team varies 
the census coverage every time. In 2018, they 
added 90 survey sites and 17,000 extra cam- 
eras. Thesetypes of differences makeit difficult 
to compare census years and to say how India’s 
tigers are faring, says Abishek Harihar, a pop- 
ulation ecologist with Panthera in Bengaluru. 

Another point of contention is the data anal- 
ysis, particularly the calibration model used to 
arrive at pan-Indianumbers. The description of 
the methodology and models used is “vague”, 
and the resulting numbers have “higher uncer- 
taintiesthanare currently reported”, says Arjun 
Gopalaswamy, a statistical ecologist and sci- 
ence adviser at the Wildlife Conservation 
Society in New York City. He has authored two 
studies critiquing the census method”. 

Jhala refutes the criticisms about the accu- 
racy of the census. He says there are safeguards 
to protect against bad data. Although the cov- 
erage has increased, he says the census is based 
onestimates of tiger density, soincreasing the 
extent of the survey does not affect the trend 
calculations. He has published a study refuting 
the accusations’. 

The best way to resolve the disagreement, 
argue scientists, would be if the WII released 
raw data and model information to ecologists 
for independent analysis. But Jhala says that 
releasing the geo-tagged data, even to scien- 
tists, could make the animals vulnerable to 
poaching — aclaim that others dispute. 

The result is that there is little consensus on 
India’s tiger population and, moreimportantly, 
whether it is rebounding or has remained 
steady for many years. For now, scientists can 
say only that the animals are thriving insome 
places, but doing poorly elsewhere. 

The biggest known conservation success is 
in central India, an area with 19 tiger reserves 
across 8 states. I travelled there in May with 
researchers from the Mumbai-based Wildlife 
Conservation Trust to see how India’s best-kept 
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tigers are faring. 

The central Indian forests in Maharashtra 
state were brown and crackly under the 45 °C 
heat. Most trees had dropped their leaves for 
the dry season, reservoirs had dipped low and 
everyone was waiting for the monsoons. 

The government says there are 1,033 tigers 
in central India, up 50% since 2014 (see ‘Cat 
count’). That’s more than one-third of India’s 
tigers. Theregionattractsa proportionally high 
number of India’s tiger scientists. 

They have found that historically, tigers here 
have moved unhindered through forest corri- 
dors in search of territory, carrying precious 
newgenes into distant populations. Thecentral 
Indian tigers have high genetic variation, which 
should help them to adapt to environmental 
crises such as drought or disease>. 

But the forest corridors in central India are 
fragmenting rapidly. Without roaming tigers, 
none of India’s small reserve populations would 
be demographically viable in the long run, says 
Aditya Joshi, head of conservation research 
at the Wildlife Conservation Trust. Uma 
Ramakrishnan, an ecologist at the National 
Centre for Biological Sciences in Bengaluru, 
says that ifinfrastructure developmentin rural 
areas continues unabated, the genetic diversity 
ofsmall populations could fall withinacentury. 

The government might then have to shuttle 
tigers between reserves to maintain the gene 


flow necessary for a population to stay healthy. 
“That will be pretty much like a zoo,” she says. 

In the worst-case scenario, tigers might 
get marooned in reserves and relatives might 
start breeding. These aren’t vague fears. In 
the Ranthambore tiger reserve, a popular 
tourist attraction in northwest India, some 
62 individuals, half of them descended from 
one matriarch, live in genetic isolation ina 
1,115 km’ area. Villages surround the reserve, 
andthere are no othertiger populations nearby 
to seed new genes. Ramakrishnan and her col- 
leagues have seen markers of inbreeding inthe 
genomes of Ranthambore tigers®. In an unpub- 
lished study, they have detected regions of over 
a million base pairs of DNA without variation. 
In an average tiger, there are 500 variations in 
every million or so base pairs. If these stretches 
harbour deleteriousalleles, the offspring could 
have reduced fitness, increasing the risk of local 
extinction, she says. 


Deadly highways 

The day before the frenzied night-time chase 
in Pench tiger reserve, Milind Pariwakam, a 
road ecologist with the Wildlife Conservation 
Trust, and 1 drove there ona four-lane motor- 
way called National Highway 44, or NH44 (also 
known as NH7). Ina nation full of potholes, | 
appreciated the smooth road connecting two 
major cities and reducing travel time. But 
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Pariwakam says the road comes ata high cost. 

A 65-kilometre section of the NH44 cuts 
through the tiger park, separating the 
core reserve from a forest corridor. Some 
40 mammalian species, including tigers, use 
this landscape. So do 6,151 trucks, cars and 
motorcycles that race down the NH44 every 
day. And this is not the only road through 
Pench; there are 24 smaller roads and another 
highway — the NH6. 

Roads kill millions of animals globally every 
year. And over time, busy roads become bar- 
riers to movement as some species learn to 
avoid them. Tigers, which prefer to stroll on 
paths rather than skulk through undergrowth, 
are attracted to roads and exhibit little fear of 
traffic. In the Russian Far East, home of the 
Siberian tiger, scientists looked at the impact 
of roads on 15 resident individuals. The roads 
carried 250 vehicles a day, a fraction of the 
traffic through Pench. The researchers found 
that tigers living in the area died sooner and 
had fewer offspring than did animals living in 
road-free areas’. 

In 2008, Pariwakam and a group of 
non-governmental organizations sued the gov- 
ernment to stop the expansion of the NH44 to 
four lanes. The fight lasted eight acrimonious 
years before WII scientists and the conserva- 
tionists came to a compromise: underpasses 
that animals could use to walk safely beneath. 

“What we always say is that conservation has 
to be affordable, it has to be sustainable,” says 
Bilal Habib, a carnivore biologist who heads 
the central India tiger programme at the WII. 
“We are a developing nation.” 

Finished in 2018, the NH44 has 9 specially 
built underpasses, ranging in length from 50 
to 750 metres, designed to allow animals to 
pass beneath the roads. These are the longest 
animal underpasses in the world, and the first 
in India. Ifevidence suggests they are effective, 
the government might deploy themin some of 
the 20,000 km of roads through wild spaces, 
Habib says. 

But although the underpasses are excellent 
on paper, Pariwakam questions their efficacy. 
Since 2018, two leopards and one tiger have 
walked across the road rather than using an 
underpass and were hit and injured. As we 
inspected one structure, a4x4 careened into 
view from a village access road and drove 
through the underpass to aservice ramp lead- 
ing to the highway. Pariwakam whipped out his 
phone and filmed the intrusion. “The villagers 
are using the shortcut to save a quarter kilo- 
metre,” hesaid, seething. He has been urging the 
forest department to close offall access roads. 


Mistaken identity 

This year, news about tiger deaths and fatal 
attacks on humans has popped up almost every 
week. As reserves have filled up, tigers are mov- 
ing into the forest corridors that connect them 
— which are also used by people. 
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Tigress T49 was born in the corridor 
enclosing the Chandrapur district, outside 
the Tadoba Andhari tiger reserve, not too far 
from Pench. There are 155 people per square 
kilometre here, living in 600 villages that are 
slowly encroaching into forests. There are also 
41 tigers, which is more than in half of India’s 
protected reserves. 

In December 2016, T49 had four cubs, named 
El to E4, ina culvert under a bridge. Villagers 
thronged on tractors and motorbikes to see 
the newborns. 

Habib of the WIl and his graduate student 
Zahidul Hussain were also interested in the 
cubs. Since 2013, Habib’s team has radio-col- 
lared adolescents to understand the behaviour 
of tigers inside and outside reserves and to 
learn about the drivers ofhuman-tiger conflict. 
They have collared 23 individuals so far, asmall 
sample size. But this is still the largest telem- 
etry, or tracking, study of tigers in the world. 
Their preliminary data are troubling. They 
suggest that non-reserve tigers move longer 
distances daily, perhaps to avoid humans and 
get around infrastructure. Consequently, 
they need 22% more food in an area already 
depleted by humans of wild prey. Habib says 
that of five tigers that left a reserve the team 
was monitoring, four died from walking into 
electrified wires. 

In March 2019, the scientists collared E1, E3 
and E4; E2 was shy and escaped, a trait that 
might serve her well amid humans. El was spe- 
cial. “As soonas you take your vehicle towards 
them, Elisthe first onetocometo you,’ Hussain 
says. “Shecomes, sits there, curious about what 
is happening.” 

The adolescents were looking for territory, 
butroads, villagesandthesummer’s sparse tree 
cover restricted their movement. E1 favoured 
a forest fringing a village. 

On 6 April, an older woman went into the 
forest to collect flowers of the mahuatree, used 
to make liquor. As she crouched down, her pos- 
ture made her look like small prey, researchers 
suspect. A tiger emerged without a sound and 
pounced. It dragged the woman 3 metres, then 
dropped her and disappeared. 

There were two more human kills in three 
weeks. Hussain’s data showed that E1 had 
been atall three kill sites, but none of the peo- 
ple was eaten, suggesting they were victims 
of mistaken identity; tigers generally don’t 
eat humans. Scientists and the forest depart- 
ment are racing to understand ways to mini- 
mize such human-animal encounters. Some 
are using camera traps to warn villagers when 
tigers are in their vicinity. Others are exploring 
ways to train locals in alternative livelihoods 
so that they don’t need to enter forests. Their 
efforts are urgent because the death tollis ris- 
ing. Across central India, villagers have killed 
21 tigers through electrocution, traps or poi- 
soning since 2015. In Chandrapur alone, tigers 
have killed 24 people in the past 4 years. 
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InJune, the forest department captured the 
tiger Eland moved her toa wildlife rehabilita- 
tion centre, making her the ninth individual to 
be relocated since 2015. But it might be atem- 
porary reprieve, as another tiger will probably 
take E1’s territory. 


Scientific battles 


Much like the animals they study, tiger 
scientists are fiercely territorial. Everyone 
except Karanth at the Centre for Wildlife 
Studies requested anonymity while speaking 
about politics because it could hinder their 
ability to do research. 

Several scientists say there is a conflict of 
interest because government managers fund 
and oversee science as well as set policies 
regarding reserves. Karanth says managers 


“ITS ANIGHTMARE 
WORKING IN 
WILDLIFE IN 

THIS COUNTRY. 


Lf 


grant research permits more easily to scien- 
tists from the government-run WII than to 
independent scientists, unless the latter join 
government-led studies as junior partners. 
Independent observers also charge that gov- 
ernment scientists sometimes rubber stamp 
government actions, whether or not they are 
scientifically sound. 

“[The WII] seems to have completely bought 
in, they seem very biased,” one scientist says. 
Anexampleis the NH44 road project: although 
the WII initially recommended much larger 
overpasses to the government, it reworked its 
assessment to reduce costs and make it more 
palatable under pressure from government 
officials, according to a government report. 

Mostindependent field initiatives have shut 
down, says Karanth. His 30-year-long study of 
tigers in southern India ended in 2017 because 
the local forest officials had been repeatedly 
interrupting or delaying his work — for exam- 
ple, bynotallowing his assistants access to field 
sites. The union and state government officials 
ignored his complaints. “To get the permits was 
becoming very impossible,” Karanth says. 

“Sadly, I’ve realized I don’t think I canimpact 
policy,” another scientist notes. 

Government officials and researchers 
challenge those criticisms. Anup Kumar 
Nayak, member secretary of the National 
Tiger Conservation Authority (NTCA), India’s 
tiger conservation and research coordinat- 
ing body, says that his agency has permitted 
several research projects by non-government 
scientists and non-profit organizations. “Most 
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of the research projects are given to [the WII] 
because they are the technical wing of NTCA, 
and they’ve been doing research work on wild- 
life for along time,” he says. “In southeast Asian 
countries, they’rea very reputed organization.” 

Nitin Kakodkar, who is the chief wildlife 
warden of Maharashtra and signs off on 
research permits in his state, disagrees that 
Wll scientists are favoured or that the manag- 
ers influence research. WII scientists, he says, 
are more knowledgeable about the permit 
requirement procedures than are independ- 
ent scientists. And he contends that there is no 
favouritism in Maharashtra. “There are people 
who’ve been doing research in Maharashtra 
whoarenot from the Wildlife Institute of India.” 

Jhala of the WII says his team finds it easier 
to get permits because they work for the gov- 
ernment, but not by much. The bureaucracy is 
difficult even for WII scientists, he says. “It’s a 
nightmare working in wildlife in this country.” 

The government maintains a tight grip 
because the tiger is asymbol of national pride, 
researchers say. That exalted status — and ris- 
ing revenues from thetourismindustry around 
tiger safaris and luxury resorts — might be what 
eventually saves the tiger from extinction. 

The Indian government has plans to expand 
tiger conservation. For example, India is going 
to increase the number of tiger reserves in 
coming years, says Nayak. 

Although numbers are stagnating in other 
countries, the ‘wild’ tiger will probably survive 
in India, at least inside reserves, researchers 
say. The animal’s fate outside reserves is more 
questionable. Older villagers don’t mind the 
large carnivores in their midst, but the younger 
generation is more circumspect. 

In Kurwahi,a village near Pench, a ‘fat’ tiger 
snatched away a calf tied outside an elderly 
woman’s house in March. Hers was one of 
17 village cattle killed by the animal. I asked 
her whether she was angry. She put her palms 
together, laughed and shook her head. “How 
canIbe angry witha tiger?” she said. 

Her son glanced hesitantly at the 
forest-department guard, who was standing 
nearby. Then he gathered up courage and said 
what other villagers had been demanding — 
that thereneedstoamore permanent solution. 
“The authorities should remove the tiger.” 


Gayathri Vaidyanathan is a science journalist 
in Bengaluru. 
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ACOMPLEX 


INHERITANCE 


Researchers are finding links between people’s 
genes and complex attributes such as how they 
vote or the time spent in school. The worry is that 
their results will be misconstrued. By David Adam 


he deep coal mine at the Yorkshire 
village of Kellingley closed in 2015 
— the last of more than 1,000 such 
pits that once drove British industry. 
As the mines closed, the jobs went 
with them. Faced with economic and 
social decline, many people who could 
moved away. 

Geneticist Abdel Abdellaoui has never been to 
Kellingley or any of the United Kingdom’s other 
former coal-mining regions. But he has found 
something surprising about the towns and their 
inhabitants. His research shows that the DNA 
in these districts is flecked with disadvantage’, 
just as the coal seams once threaded through 
the ground. 

By looking at the genomes of people living in 
former coal-mining areas, he has found genetic 
signatures associated with spending fewer years 
at school compared with people outside those 
areas, and — at weaker significance levels — var- 
iants that correlate with lower socio-economic 
status. Some genetic variants even correlate with 
political persuasion and whether or not commu- 
nities voted to leave the European Union in the 
2016 Brexit referendum. 

Abdellaoui, who works at the University of 
Amsterdam in the Netherlands, acknowledges 
that he is venturing onto politically charged 
ground. “I try to understand human genetic 
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variation and this is what I run into,’ he says. 

The study! — published this week in Nature 
Human Behaviour — is a high-profile example 
of an emerging trend: using huge amounts of 
data and computing power to uncover genetic 
contributions to complex social traits. Studies 
published in the past decade have examined 
genetic variants linked to aggression, same-sex 
sexual behaviour, well-being and antisocial 
behaviours, as well as the tendency to drink 
andsmoke. In doing such science, geneticists are 
heading for controversial territory. They have 
even been accused of “opening a new door to 
eugenics’, according to the title of a2018 MIT 
Technology Review article by science historian 
Nathaniel Comfort’. 

To the geneticists and social scientists doing 
this work, the results offer a useful andimportant 
guidetothe relative contributions of nature and 
nurture to specific behavioural traits — just as 
genetic analysis can already highlight people 
who have an increased risk of cancer or heart dis- 
ease. The approach could, for example, improve 
understanding of how the environment affects 
complex traits, and so offer away to interveneto 
improve areas such as public education. 

“Itis super-exciting,” says Philipp Koellinger, a 
genoeconomistat Vrije University Amsterdam in 
the Netherlands. “It gives us better and more-pre- 
cise ways for scientists to answer questions they 


© 2019 Springer Nature Limited. All rights reserved. 


have been interested in for along time.” 

Caveats abound. The genetic contribution to 
any behavioural trait is relatively small and easily 
swamped by the influence of the environment. 
The studies can reveal only whether someoneis 
likely to have a certain trait, and cannot predict 
the qualities of any one individual. Most scien- 
tists are quick to point out why they dothis work 
—to establish what role, if any, genetics has in 
behaviour — and to lay outits limitations. 

But not everyone is listening: already, some 
companies see a market in reading DNA like 
a fortune-teller reads tea leaves. “That stuff 
totally gives me the shivers. But it’s happening,” 
Koellinger says. 

Critics charge that the ethical and societal 
risks of acting on such information are too great. 
“One of the main concerns is not so much the 
study of genomics, but how are we going to use 
it,” says Maya Sabatello, a bioethicist at Columbia 
University in New York City. “Who's going to ben- 
efit? Who’s not going to benefit? We live inavery 
unequal society and this is a major challenge.” 


Strength in numbers 


For decades, geneticists assumed that most 
traits were governed by just a handful of genes 
— whether it was a relatively simple one suchas 
height, or something as complex as antisocial 
behaviour. But as the sample sizes swelled, 
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In former UK coal mining areas, genetic variants are linked with spending less time in school. 
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researchers began to find hundreds of variants 
that each have a relatively small effect ona trait. 
These projects — known as genome-wide asso- 
ciation studies (GWAS) — build up a picture of 
which DNA letters vary from person to person 
(called single nucleotide polymorphisms, or 
SNPs), which variants are most common in 
people with a given trait and how much of 
the difference between individuals these SNP 
patterns represent. 

Adding up the contributions made by all 
these spots on the genome gives researchers 
a measure of the importance of genetics toa 
trait, known as a polygenic score. For height, 
which is known to have a strong genetic influ- 
ence, GWAS show that variants can together 
account for 20% of the variation. 

As studies into physiology and disease piled 
up, scientists began to wonder whether the 
methods would work on social and psycho- 
logical attributes. 

For some complex traits, such as social 
isolation, researchers have found only a weak 
influence; one study’ noted that heritability for 
that trait hovers at 4%. But for others, the sig- 
nal from genetics studies has blossomed from 
initially feeble to surprisingly strong. In 2013, a 
large group of researchers working under the 
umbrella name The Social Science Genetic 
Association Consortium (SSGAC) reported the 


first GWAS of educational attainment’, defined 
as years of schooling. The study found three 
SNPs that together could explain a meagre 2% 
of the variation in years of education. But then, 
a 2016 repeat by the same consortium using a 
sample that included almost 300,000 people — 
more than double the number in the 2013 study 
— found 74 SNPs that could explain 3.2% of the 
variation’. When the consortium combined 
data from 1.1 million people, they discovered 
more than 1,200 SNPs that together accounted 
for 11-13% of the variation®. That means the 
genes for educational attainment can explain 
about as much variation ina child’s time in edu- 
cationas their family’s socio-economic status 
can. “I think that’s really quite remarkable,” says 
Tim Morris, an epidemiologist at the University 
of Bristol, UK. 

Beyond education, researchers have 
examined other socially shaped traits. In 2016, 
for instance, the SSGAC published a GWAS of 
almost 300,000 people and identified 3 SNPs 
associated with self-reported measures of 
well-being’. And in 2017, a weak genetic signa- 
ture for antisocial behaviour showed upina 
GWAS of a group of 6,200 Finnish prisoners®. 
Neither study produced a polygenic score, but 
researchers expect scores for these traits will 
emerge as sample sizes continue to grow. 

The growing power of GWAS inspired 
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Abdellaoui to ask a different question: how 
dosocial traits such as educational attainment 
vary across a country? To find out, he and his 
team dug into the UK Biobank data set, which 
holds blood and tissue samples and survey 
responses for almost 450,000 people and 
cross-references the information to medical 
data such as hospital admissions. 

The team looked at previous studies to amass 
alist of 33 health and behavioural traits and the 
genetic variants that influence them, adding up 
the contribution of each variant to get a poly- 
genic score. The researchers then investigated 
the UK Biobank samples to see whether these 
genotypes differed across the United King- 
dom. They first discounted genetic variation 
caused by historical regional differences in 
ancestry, throwing out variants that are com- 
mon because of shared ancestry rather than 
because they governatrait. Then they could see 
which traits still clustered into certain regions. 
For some traits — caffeine consumption, for 
example — there was no regional difference. 
But for others, suchas educational attainment, 
the difference was significant. The researchers 
found that people living in former coal-mining 
regions had, on average, fewer genetic variants 
that correlated with staying in school longer or 
with going onto higher education’. 

Peter Visscher, a geneticist at the University 
of Queensland in Australia who worked onthe 
study, says it’s not clear what underlying biol- 
ogy the genetic patterns identified represent. 
“see that as a proxy for genes to do with intel- 
ligence and maybe perseverance, and maybe 
a bit of risk-taking.” 

Abdellaoui stresses that what they have 
produced is more description than explana- 
tion. “There area whole bunch of variables that 
are clustering in the lower economic areas, but 
it’s very difficult to say anything about direc- 
tions of causality.” 

The researchers think the regional difference 
is down to the migration of more-educated 
people to richer areas that offer them jobs, 
leaving behind people who have genetic sig- 
natures linked to spending less time in school. 
This social stratification could become more 
marked over time, they say. “If that goes on for 
multiple generations, then for the sort of social 
inequalities already there, you run the risk of 
increasing those inequalities on a biological 
level,” says Abdellaoui. 

The researchers found the same geographic 
pattern for other traits, but the relationships 
were weaker. Genotypes known to be strongly 
associated with lower socio-economic status 
and lower cognitive ability were found more 
often in the poorer areas. These genotypes, 
the scientists reported, were associated with 
people’s political views. Those in coal-mining 
areas had more genetic variants linked to lower 
socio-economic status, and were also more 
likely to vote for the left-wing Labour party 
or the right-wing UK Independence Party. 
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Individuals were also more likely to have voted 
for the United Kingdom to leave the EU in the 
Brexit referendum. Abdellaoui says this does 
not mean that someone is genetically predis- 
posed to vote ina certain way. 

Other researchers in the field agree with this 
caution. “Overall I like the paper and think that 
they’ve done a good job with it,” says Morris. 
“My main fear is that these results will be 
over-interpreted. They are informative descrip- 
tive statistics, but descriptive nonetheless.” He 
also notes the UK Biobank data are “extremely 
selective” and not likely to fully represent the 
populations of the former coal-mining regions. 
“For the regional results, these really must be 
interpreted with care.” 

The results of this kind of study are based 
on associations, and must be presented very 
carefully to prevent suggestions that a person’s 
genes determine their outcomes, says Daniel 
Benjamin, a behavioural economist at the Uni- 
versity of Southern California in Los Angeles. 
He is wary of comparisons between his field 
and the spectre of eugenics, an idea from the 
beginning of the twentieth century that people 
seen as having ‘inferior’ genes should be pre- 
vented from having children. “Those of us who 
do workin this area have an ethical obligation, 
and that ethical obligation is even stronger in 
the case of the genetics of behaviour because 
of past terrible misinterpretations and horrible 
consequences,” he says. 

One of the biggest sources of confusion is 
what a polygenic score actually shows about 
the contributions of nature and nurture, Ben- 
jamin says. “People have a really hard time 
understanding that genes don’t determine 
behaviour.” 

Abdellaoui says of his UK study: “We are in 
no way suggesting the genes are the sole deter- 
minant of someone’s educational outcome. It’s 
a combination of environmental and genetic 
effects.” 


Genetics in the classroom 


Another disclaimer is that polygenic scores 
represent the ‘risk’ of having a particular trait, 
and don’t necessarily suggest that genetics 
is a major factor in behaviour. For instance, 
the scores cannot foretell that one individual 
will definitely graduate from university and 
another will quit school aged 16. “I don’t think 
that polygenic scores are at the level of predic- 
tive ability that would allow youto make those 
kinds of individual judgements with any degree 
of certainty,’ says Paige Harden, a psychologist 
at the University of Texas at Austin. 

When Benjamin and his team put together 
the most recent GWAS on education®, his team 
released an accompanying 20-page list of fre- 
quently asked questions to explain the study's 
motives, which made clear that the scientists 
thought there were no implications for educa- 
tion policy. Not everyone is so cautious, says 
Morris. “There are quite a few academic papers 
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coming out that can’t resist a final sentence 
right at the end, along the lines of ‘the DNA 
revolution is coming and genes will soon be 
useful for predicting education’, which I think 
is quite irresponsible,” he says. He wants such 
papers to include more context — for example, 
pointing out that existing information suchas 
a student’s previous attainment can already 
do a better job of predicting their future 
performance than a polygenic score can. 

A working group announced earlier this 
month by bioethics think tank The Hastings 
Center in Garrison, New York, plans to examine 
the field and advise researchers and stakehold- 
ers on howto conduct and talk about the work 
(see go.nature.com/2vtbpey). 

But others are less guarded. They argue that 
genetic screens of behaviour and cognitive 


“Many teachers are worried 
that trying to use genetics 
as a tool in education could 
potentially be misused.” 


ability could help children as young as three 
to fare better at school. “It can’t be right for 
education to continue to ignore genetic 
influence, because it’s far and away the most 
important source of individual differences,” 
says Robert Plomin, a psychologist at King’s 
College London, whois one of the more bullish 
voices in the debate and whose interpretations 
of the studies are controversial. 

Sabatello, the bioethicist, predicts that the 
first applications will be in specialist education, 
such as for cases in which the parents of chil- 
dren with conditions such as attention deficit 
hyperactivity disorder (ADHD), autism spec- 
trum disorder or dyslexia could use genotypes 
as evidence to demand a different approach 
for their child. “Parents want the genomic 
information to persuade authorities or educa- 
tional entities that their kids need the specialist 
intervention.” 

At the moment, there are no reliable 
polygenic scores to assess the contribution 
of genes to these conditions, but large-scale 
studies, more powerful than those done before, 
including a major GWAS currently under way 
for ADHD, could produce them in the future. 

Although the focus on identifying and 
helping children with extra educational needs 
might sound altruistic, it, too, has a troubling 
historical precedent. Intelligence tests, which 
were first developed at the beginning of the 
twentieth century to pick out children who 
could benefit from extra attention, quickly 
became used to reinforce discrimination 
against minority populations or institution- 
alize children deemed to be ‘feeble-minded’. 

“Many teachers are worried that trying to use 
genetics as atool in education could potentially 
be misused to validate race and class-based 
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differences,” says Daphne Martschenko, who 
has just finished a PhD at the University of 
Cambridge, UK, that investigated attitudes in 
education to genetics. 

Infact, because GWAS are done mostly using 
data from people of European ancestry, this 
could make the results less applicable for differ- 
ent ethnic groups. “A real pragmatic challenge 
is that we don’t have good genetic indicators 
for children of colour,” Harden says. 

Morris thinks that this could compound 
existing inequality in education. “If you can’t 
dosomething for everyone inthe system, then 
youcan't doit.” 


Responsible research 


Many in the field agree that the most useful 
application of these results will be to allow bet- 
ter-quality research into environmental — not 
genetic — influences on complex behavioural 
traits, by taking out the influence of genetics 
while studying some other factor. “It’s an unsexy 
thing to talk about,” says Harden, “but a better 
idea is using genetics as a control variable to work 
out what actually works to improve learning.” 

Researchers could include children with 
similar polygenic scores in both the control 
and test groups when trialling an intervention, 
for instance. 

The results could also help scientists to 
probe whether the effects of genetics depend 
on an individual’s environment — whether 
certain gene variants kick in only under some 
circumstances. And more-sophisticated 
genetic studies could unpick the importance 
of something called genetic nurture, in which 
environmental influences are misidentified as 
genetic. This could be the case with education, 
because well-educated parents both pass on 
their genes and are more likely to contribute 
indirectly by encouraging their children’s 
schooling’. 

The priority for most researchers in this field 
is to do more and bigger studies, to produce 
ever-stronger signals and tackle different 
traits such as income and social withdrawal. 
Meanwhile, those at the educational coalface 
don’t need insight from genetics to improve 
outcomes, says Sabatello. “We need to look 
at the environment. Children who are hungry 
can’t study. We don’t need to have their genes 
for that.” 


David Adam is a freelance journalist based 
near London. 
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Double deception in the asylum? 


Susannah Cahalan’s investigation of the social-psychology experiment that saw 
healthy people sent to mental hospitals finds inconsistencies. By Alison Abbott 


rom 1969 to 1972, an extraordinary 

experiment played out in 12 psychiat- 

ric institutions across 5 US states. Eight 

healthy people — including David Rosen- 

han, a social psychologist at Stanford 
University in California, who ran the exper- 
iment — convinced psychiatrists that they 
needed to be committed to mental hospitals. 
The ensuing paper, published in Science in 1973, 
opens with the words: “If sanity and insanity 
exist, how shall we know them?” It claimed that 
the psychiatric establishment was unable to 
distinguish between the two. 

Rosenhan’s study had far-reaching and 
much-needed effects on psychiatric care in 
the United States and elsewhere. By the 1980s, 
most psychology textbooks were quoting it. 
It also influenced society more widely, and 
not always positively: in the law courts, for 
instance, it undermined the value of expert 
testimonies from psychiatrists. Now, in The 
Great Pretender, journalist Susannah Cahalan 
turns a fresh, critical eye on the experiment 
and the shockwaves it sent through the field 
and beyond. 

Cahalan quotes a former colleague of 
Rosenhan’s, who notes that he was a good 
networker, an excellent lecturer and a gener- 
ally charismatic character. “But some people 
in the department called him a bullshitter,” 
Kenneth Gergen says. And through her deeply 
researched study, Cahalan seems inclined to 
agree with them. She discovered that the man 
whom she had initially admired, and who had 
done so much to change how mental illness 
was perceived, was not all that he had seemed. 
And neither, she argues, was his famous 
experiment. 

Cahalan began her investigation into 
Rosenhan’s experiment in good faith. Ten years 
ago, she developed paranoia, hallucinations 
and, eventually, seizures. She was dosed with 
antipsychotics before being correctly diag- 
nosed with a very rare type of autoimmune 
encephalitis, an ordeal she describes in her 
first book, Brain On Fire. 

After it was published in 2012, a casual con- 
versation with McLean Hospital psychiatrists 
in Boston, Massachusetts, alerted Cahalan 
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David Rosenhan and his volunteers feigned symptoms to be admitted to psychiatric hospitals. 


to Rosenhan’s experiment. She immediately 
wanted to know more — about the experiences 
of those who volunteered, and the challenges 
that such a risk-laden experiment would have 
posed decades ago. 

Rosenhan was not the first to infiltrate a 
psychiatric hospital and report on condi- 
tions. Cahalan tells, for example, of the nine- 
teenth-century journalist Nellie Bly, who 
deceived doctors to spend ten days in an 
overcrowded women’s asylum on Blackwell’s 
Island, New York. Bly’s reports of the appalling 
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conditions there shamed politicians into 
increasing the asylum’s budget. 

But Rosenhan was the first to carry out a 
formal experiment involving a number of 
“pseudopatients”. All eight, including Rosen- 
han, reported the same symptom to different 
doctors: that they heard voices uttering “thud, 
empty, hollow”, denoting existential doom. 
Seven were diagnosed with schizophrenia; 
one with manic depression. Once admitted to 
hospital, the volunteers stopped simulating 
symptoms of abnormality. Rosenhan noted 


DENVER POST VIA GETTY 


in the Science paper that genuine patients 
often realized that the pseudopatients did 
not havea mental-health disorder, and accused 
them of being undercover journalists or aca- 
demics checking up on the hospital. Psychi- 
atrists seemed less perceptive: it was several 
weeks before some of the pseudopatients 
got discharged. 

Although Rosenhan died in 2012, Cahalan 
easily tracked down his archives, held by social 
psychologist Lee Ross, his friend and colleague 
at Stanford. They included the first 200 pages 
of Rosenhan’s unfinished draft ofa book about 
the experiment. 

At first, it seemed that Cahalan’s research 
was going to be easy, even though Rosenhan 
had given fictitious names to the pseudopa- 
tients she wished to track down, along withthe 
hospitals they went to. Ross warned her that 
Rosenhan had been secretive. As her attempts 
to identify the pseudonymous pseudopatients 
hit one dead end after the other, she realized 
Ross’s prescience. 

The archives did allow Cahalan to piece 
together the beginnings of the experiment in 
1969, when Rosenhan was teaching psychol- 
ogy at Swarthmore College in Pennsylvania. 
The students complained that the course was 
too abstract, so Rosenhan suggested that they 
check into a psychiatric hospital to get to know 
people with schizophrenia personally. The 
superintendent of the local Haverford State 
Hospital was willing to take them on, but Rosen- 
han cautiously decided to check things out for 
himself first. He emerged humbled from nine 
traumatizing days in a locked ward, and aban- 
doned the idea of putting students through 
the experience. But it set him thinking about 
ascientific experiment aimed at exposing the 
system’s travesties. 

According to Rosenhan’s draft, it was at a 
conference dinner that he met his first recruits: 
arecently retired psychiatrist and his psycholo- 
gist wife. The psychiatrist’s sister also signed 
up. But the draft didn’t explain how, when and 
why subsequent recruits signed up. 

Cahalan interviewed numerous people who 
had known Rosenhan personally or indirectly. 
She also chased down the medical records of 


The Great Pretender: 
The Undercover 
Mission that Changed 
our Understanding of 
Madness 

Susannah Cahalan 
Grand Central 
Publishing (2019) 


individuals whom she suspected could have 
been involved in the experiment, and spoke 
with their families and friends. But her sleuth- 
ing brought her to only one participant, a 
former Stanford graduate student called Bill 
Underwood. 


“Patients often realized that 
the volunteers were notill, 
and accused them of being 
undercover journalists.” 


Underwood and his wife were happy to talk, 
but two of their comments jarred. Rosenhan’s 
draft described how he prepared his volunteers 
very carefully, over weeks. Underwood, how- 
ever, remembered only brief guidance on how 
to avoid swallowing medication by hiding pills 
in his cheek. His wife recalled Rosenhan telling 
her that he had prepared writs of habeas corpus 
for each pseudopatient, in case an institution 
would not discharge them. But Cahalan had 
already worked out that that wasn’t so. 

Comparing the Science report with docu- 
ments in Rosenhan’s archives, she also noted 
many mismatches in numbers. For instance, 


Rosenhan’s draft, and the Science paper, stated 
that Underwood had spent seven days ina hos- 
pital with 8,000 patients, whereas he spent 
eight days in a hospital with 1,500 patients. 

When all of the leads from her contacts led 
to ground, she published a commentary in 
The Lancet Psychiatry asking for help in find- 
ing them — tono avail. Had Rosenhan invented 
them, she found herself asking? 

In recent years, other heroes of social 
psychology have been found to have misrep- 
resented their data. The most prominent case 
is that of Dutch social psychologist Diederik 
Stapel, who was forced to retract 58 papers. 
Those who have followed these cases might 
be appalled by the Rosenhan story, but will not 
be surprised. 

Cahalan, whose life was saved by front-line 
medical science in the context of psychiatry, 
was shocked by what she found. She writes 
that she cannot be completely certain that 
Rosenhan cheated. But she is confident enough 
to call her engrossing, dismaying book The 
Great Pretender. 


Alison Abbott writes from Munich, Germany. 
e-mail: alison.abbott.consultant@ 
springernature.com 


The rise of the 
greedy-brained ape 


Gaia Vince takes an enjoyable sprint through 
human evolution. By Tim Radford 


azeintoa mirror. Reflected is amarvel 
of evolution: a weak-jawed, bipedal 
omnivore witha greedy brain, in which 
100 billion neurons consume 20% of 
the body’s energy intake. Science 
journalist Gaia Vince urges us towards such 
reflections in Transcendence, a book tracing 
the journey of Homo sapiens through genes, 
environment and culture to what might be, she 
surmises, a new state of being. 

For her hugely enjoyable sprint through 
human evolutionary history, Vince (erstwhile 
news editor of this journal) intertwines many 
threads: language and writing; the command 
of tools, pursuit of beauty and appetite for trin- 
kets; and the urge to build things, awareness 
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of time and pursuit of reason. She tracks the 
cultural explosion, triggered by technologi- 
cal discovery, that gathered pace with the first 
trade in obsidian blades in East Africa at least 
320,000 years ago. That has climaxed this 
century with the capacity to exploit 40% of the 
planet’s total primary production. 

How did we do it? Vince examines, for 
instance, our access to and use of energy. Other 
primates must chew for five hours a day to sur- 
vive. Humans do so for no more than an hour. 
We are active 16 hours a day, a tranche during 
which other mammals sleep. We learn by blind 
variation and selective retention. Vince pro- 
poses that our ancestors enhanced that process 
of learning from each other with the command 
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Shilluk people in Sudan gather in the shade for traditional storytelling. 


of fire: itis 10 times more efficient to eat cooked 
meat than raw, and heat releases 50% of all the 
carbohydrates in cereals and tubers. 

Thus Homo sapiens secured survival and 
achieved dominance by exploiting extra 
energy. The roughly 2,000 calories ideally 
consumed by one human each day generates 
about 90 watts: enough energy for one incan- 
descent light bulb. At the flick of a switch or 
turnofakey, the average human now has access 
to roughly 2,300 watts of energy from the hard- 
ware that powers our lives — and the richest 
have much more. 

Humans are more social than other primates. 
We can keep track of around 150 other people, 
which demands a large brain and might also 


Transcendence: 

How Humans 
Evolved through Fire, 
Language, Beauty, 
and Time 

Gaia Vince, Allen Lane 
(2019) 


TRANSCENDENCE 


GAIA VINCE 
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help to expand it. To learna fact stimulates one 
part of the brain; to hear a story activates many. 
That is why we find information 22 times more 
memorable in narrative form. Homo sapiens 
is a storytelling animal, and this adaptation 
ensures the transmission of skills and knowl- 
edge as fable, epic or cautionary tale. Vince, 
drawing on brain-scan studies, shows that 
neuroscientists have noted asynchrony, both 
spatial and temporal, between speaker and 
listener during storytelling, a phenomenon 
known as ‘neural coupling’. 

The human capacity for narrative, metaphor 
and pattern-matching can lead us to see 
meaning where there is none, however. Ina 
US psychological experiment in 1944, students 
were shown ashort animation of two triangles 
and a circle passing across a screen, while a 
rectangle remained stationary (F. Heider & 
M. Simnel Am. J. Psychol. 57, 243-259; 1944). 
Of the subjects, 33 out of 34 anthropomor- 
phized the moving shapes, creating narratives 
of anxiety, concern, rage and frustration. 

Vince continually returns tothe evolutionary 
triad of genetics, environment and culture 
to address our similarities and differences. 
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Some human biological adaptations are part 
of cultural variety. The semi-nomadic Moken 
people of Thailand can see clearly underwater 
because they can constrict their pupils to the 
maximum limit of human capacity, increas- 


“We find information 
22 times more memorable 
innarrative form.” 


ing depth of field and changing the lens shape. 
This is a learnt capacity: in an experiment, 
Swedish children mastered it. Divers of the 
Bajau people in Indonesia, however, exemplify 
heritability and environmental selection at 
work. Their spleens are 50% larger than aver- 
age, acting as a reservoir of oxygenated blood 
and endowing them with consummate endur- 
ance underwater. 

Our most profound cultural tool, language, 
is in some ways culturally selected. We owe 
our acrobatic way with words to a larynx that 
descends at three months of age. Thereafter, 
Vince notes, we can no longer swallow and 
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breathe at the same time. Our languages shape 
our thinking and cultural identity in many 
ways, but environment also shapes speech. 
Languages in warm, wet, wooded regions tend 
to have more vowels and fewer consonants. 
Languages that emerged at altitude have 
more words with a strong expulsion of air in 
the consonants. 

Tonality in languages (in which a word has 
different tones that change meaning) is impor- 
tant. The emergence of non-tonal languages 
over the past 50,000 years — Homer’s Greek 
was tonal, modern Greek is not — might have 
influenced the spread of two gene variants 
involved in brain growth, according to a 2007 
study (D. Dediu and D. R. Ladd Proc. Natl Acad. 
Sci. USA 104, 10944-10949; 2007). So words 
also shape our inheritance. 

Vince has a lot to say about words. The 
average response rate between speakers dur- 
ing a conversation is 200 milliseconds. But it 
takes 600 milliseconds for the signal to go from 
ears to brain, to understanding, to the prepara- 
tion of aresponse and its transmission. Thus, 
conversation must rely ona sophisticated pre- 
dictionsystem that commits alarge part of the 
brain to both speaking and listening. Language, 
writes Vince, “gives us an unparalleled ability 
to convey an infinity of ideas. We use it mainly 
to talk about ourselves.” 

Of course we do. Humans might not be so 
much Homo sapiens as Homo narcissus, the 
self-absorbed species. Yet all of our capac- 
ities together have, in their different ways, 
endowed us with the capacity to become a 
super-organism. We are now a globally con- 
nected urban species, outsourcing our brains 
to computers, increasingly to artificial intel- 
ligence and (so far) to nine billion robots. 
We have begun the Anthropocene, and our 
demands on the planet are not sustainable. 
That could usher in a new dark age, or a global 
order inanewshared civilization. We transcend 
our evolutionary beginnings. 

Vince dubs this emerging species Homo 
omnis, or Homni for short. Her chosen ana- 
logue for such a biological super-organism is 
not flattering: it is the slime mould, in which 
single cells coalesce as one to move on. The for- 
tunate are protected at the centre; those onthe 
margin become vulnerable to environmental 
change. Which sounds disturbingly like us. 

Many aspects of Transcendence have been 
explored before. And, with that wealth of 
palaeoanthropological and other research 
to draw from, most of the chapters become a 
mosaic of tersely introduced evidence. Read it 
anyway. It is at least 22 times more memorable 
than many textbooks, anda good story without 
— so far — a happy ending. 


Tim Radford is a former science editor of 
The Guardian. His book The Consolations of 
Physics is published by Sceptre. 

e-mail: radford.tim@gmail.com 


Books in brief 


SARAH COLE 


Inventing Tomorrow 

Sarah Cole Columbia University Press (2019) 

H. G. Wells was, asserts scholar Sarah Cole, a pioneer adept at 
“rescaling the cosmos and humanity’s place in it”. He straddled the 
border between science and literature, but not all his complexities 
were benign: he both repudiated racism and for some time shamefully 
ascribed to ideas on eugenics. Cole adroitly captures Wells, from his 
mould-breaking books (such as the 1895 science-fiction classic The 
Time Machine and 1920 Outline of History) to his unlikely intellectual 
kinship with subtle modernists such as Virginia Woolf. 
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Mistakes 


Successful 


Scatterbrain 

Henning Beck Greystone (2019) 

The human brain at work, notes neuroscientist Henning Beck, is 
sloppy — and that is precisely what makes us creative powerhouses. 
Beck’s coolly amusing narrative takes us through forgetting, pigeon- 
holing, distraction and deep into creativity. He explores how idle 
wool-gathering is more conducive to creativity than is ‘efficient’ 
thinking, and the uncannily similar way in which true and false 
memories are generated in the brain. His is a hopeful message, 
ultimately. If we don’t err, we don’t change. So: “stay fallible”. 
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ANIMAT! ON 


The Queens of Animation 

Nathalia Holt Little, Brown (2019) 

The early hand-drawn animations of Walt Disney Studios remain a 
technological wonder. Few know, however, of the company’s female 
virtuosi, who from the 1930s on injected nuance into characters from 
Bambi to a panoply of princesses. In her gripping corrective, Nathalia 
Holt ushers these animators and story developers into the limelight: 
Bianca Majolie, Sylvia Holland, Retta Scott, Grace Huntington and 
Mary Blair. Particularly in the early years, Holt shows, they paid a high 
price to work, forced to battle harassment in mostly male teams. 
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Rebel Star 

Colin Stuart Michael O’Mara (2019) 

This compelling portrait of the Sun packs in facts while speculating 
on gaps in our knowledge. Astronomy journalist Colin Stuart traces 
the arc of discovery from the fourth-century BC heliocentricism of 
Aristarchus of Samos through solar spectroscopy, star formation and 
nuclear fusion, the “epic journey” of sunlight to Earth and more. The 
Sun is both bountiful and belligerent, he reminds. Solar power could 
make 87% of countries energy self-sufficient — but the next big solar 
storm could send our electrical infrastructure into meltdown. 
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PURSUING CANCER TO THE LAST 


® 


AZRA RAZA 


The First Cell 

Azra Raza Basic Books (2019) 

Each year, the United States spends US$150 billion on treating 
cancer. Yet as oncologist Azra Raza notes in this incisive critique- 
cum-memoir, the treatments remain largely the same. Raza wants 

to see change: eliminating the first cancer cell rather than “chasing 
after the last”, which is doable with current technologies. Meanwhile, 
she braids often-harrowing stories of patients, including her own 
husband, with insights gleaned from laboratory and literature on this 
complex, often confounding array of diseases. Barbara Kiser 
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Setting the agenda in research 
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Ethical research — the long and 
bumpy road from shirked to shared 


Sarah Franklin 


Fromall tooscarce, to 
professionalized, the ethics of 
research is now everybody's 
business, argues Sarah 
Franklin in the sixth essay in 
aseries on howthe past 150 
years have shaped science. 


Anniversary 
collection: 
go.nature.com/ 
nature150 


nature 


nthe autumn of 1869, Charles Darwin was 

hard at work revising the fifth edition of On 

The Origin of Species and drafting his next 

book, The Descent of Man, to be published 

in 1871. As he finished chapters, Darwin 
sent them to his daughter, Henrietta, to edit — 
hoping she could help to head off the hostile 
responses to his debut, including objections 
to the implication that morality and ethics 
could have no basis in nature, because nature 
had no purpose. 

That same year, Darwin’s cousin Francis 
Galton published Hereditary Genius, abook that 
recast natural selection as a question of social 
planning’. Galton argued that human abilities 
were differentially inherited, and introduceda 
statistical methodology to aid “improvement 
of the race”. Later, he coined the term ‘eugenics’ 
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to advocate selective reproduction through 
application of the breeder’s guiding hand. 

Darwin’s transformative theory inspired 
modern biology; Galton’s attempt to equate 
selection and social reform spawned eugenics. 
The ethical dilemmas engendered by these two 
late-nineteenth-century visions of biological 
control proliferate still. And, as older quanda- 
ries die out, they are replaced by more vigor- 
ous descendants. That there has never been 
a border between ethics and biology remains 
as apparent today as it was 150 years ago. The 
difference is that many of the issues, such as 
the remodelling of future generations or the 
surveillance of personal data, have become as 
everyday as they are vast in their implications. 
To work out how to move forward, it is worth 
looking at how we got here. 
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Embryologist lan Wilmut encounters his brainchild Dolly the cloned sheep, which is now stuffed and went on show at an exhibition in 2015. 


In the late nineteenth century, like today, 
society was in upheaval and science was on 
a roll. With Darwin’s bold hypotheses set 
before them, Victorian breeders, microsco- 
pists, collectors, astronomers, geologists 
and anatomists sought to discover the laws 
interconnecting life’s core processes — often 
by using ingenious experimental designs. To 
probe the formative effects of gestation on 
heredity inmammals, the gentleman naturalist 
Walter Heap, alaboratory demonstrator at the 
University of Cambridge, UK, conducted the 
first experiments in transferring embryos from 
one variety of rabbit to another at his homein 
Prestwich in the 1890s. His methods typified a 
newera of disrupt-and-learn biology. 


Biology rebooted 


By the early part of the twentieth century, what 
had come to dominate was “the biological 
gaze”, to quote historian Evelyn Fox Keller at 
the Massachusetts Institute of Technology in 
Cambridge’. Rather than simply observing life, 
experimenters began to manipulate its com- 
ponent parts to test the limits of the system, 
mix up ingredients and turn biology inside out. 

In 1903, the embryologist Hans Spemann 
conducted his famous experiments with 
amphibians. Using one of his infant daugh- 
ter’s fine, elastic hairs, he tied aloop around a 
fertilized salamander egg to create an animal 
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with two heads and one tail. That same decade, 
inthe United States, physiologist Jacques Loeb 
pursued anew engineering biology’, trying out 
allsort of chemicals and conditions to prompt 
development in model organisms such as sea 
urchins’. 


“The bar for proper scrutiny 
has not so much been 
lowered as sawn to pieces.” 


lan Wilmut, who led the team that created 
Dolly the cloned sheep in the 1990s (at the Ros- 
lin Institute near Edinburgh, UK), once stated 
that it was Dolly’s birth that ushered in “the age 
of biological control” — and made obsolete the 
expression “biologically impossible”. In fact, 
this view of life was born at least a century ear- 
lier. And as confident experimentation turned 
ever more closely and deliberately towards 
humans, the relationships between research, 
industry and governments became a tangled 
ethical bank, and have remained so ever since. 

Eugenics, never without its trenchant oppo- 
nents, became an increasingly crucial part ofa 
new world order over the course of the twen- 
tieth century. It is particularly associated with 
the mass-sterilization campaigns that began 
after Indiana’s 1907 act, and with the Nazi 
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racial-hygiene programme that reached its 
nadir in the Holocaust. 

Another legacy of the eugenics movement 
is the management of populations using 
techniques such as demography, racial clas- 
sification and statistical modelling. These, 
combined with family planning, became 
synonymous with modernity and progress. 
From Latin America and Scandinavia to India, 
China and the Soviet Union, eugenics took 
root in projects to ‘improve the population’ 
throughout the twentieth century. Eugenic 
presumptions about the differential fitness 
of native and immigrant populations were 
central to colonial administrations across the 
British Empire. Census-takers created ‘races’ 
and ‘tribes’ where none existed, for the purpose 
of managing populations more ‘scientifically’. 
These categorizations got inked into emerging 
nations across Africa and southeast Asia, and 
continue to shape definitions of race in coun- 
tries including Malaysia and Singapore today. 

The logic of the modern nation state is inno 
small part provided by eugenic techniques of 
classifying and controlling citizens, as pointed 
out by historians Alison Bashford, now at the 
University of New South Wales in Sydney, Aus- 
tralia, and Philippa Levine, at the University of 
Texas at Austin>. This typological approach to 
administration was normalized through what 
has been called the “prism of heritability” by the 
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sociologist Troy Duster, now at the University 
of California, Berkeley®. That had the effect of 
linking together the pathologization of mental 
illness, homosexuality, criminality, poverty, 
ethnicity and race into a discourse of ‘rational’ 
management that became mainstream. 

In other words, the principles of the eugenics 
movementare part of contemporary society's 
DNA. Across national and global policies affect- 
ing everything from health care, fertility and 
incarceration to border control, education and 
regional development, the goal of shaping the 
population through selective pressures — such 
as creating a “hostile environment” for immi- 
grants — is alive and well. 


The rise of bioethics 


The birth of bioethics in the 1970s was in no 
small part a response to harmful research 
projects undertaken within this context — 
on vulnerable groups such as immigrants, 
prisoners and psychiatric patients — and with- 
out meaningful consent. The field emerged 
largely in the United States, partly driven by 
the international outrage at the exposure in 
1972 of the covert US Public Health Service 
research project at Tuskegee University — in 
which more than 400 black US men, mostly 
poor share-croppers from Alabama, were sub- 
jected to untreated syphilis between 1932 and 
1972. As many as half of them died, and 60 of 
their wives and children contracted the disease. 

In 1974, the US government passed the 
National Research Act and established the 
National Commission for the Protection of 
Human Subjects of Biomedical and Behavioral 
Research. Two years later, the commission 
drafted a report outlining in detail the “basic 
ethical principles and guidelines that should 
assist in resolving the ethical problems that 
surround the conduct of research with human 
subjects”. 

In 1978, this was published as The Belmont 
Report’ in the US Federal Register, establish- 
ing guidance for national research and the 
three pillars of modern bioethics. These were: 
respect for persons, beneficence (‘doing good’) 
and justice. The report also clarified the basis 
for informed consent of study participants, and 
helped to enforce mandatory policies for ethi- 
cal oversight of research. The three principles 
were largely aimed at preventing the mistreat- 
ment of vulnerable individuals and communi- 
ties. Under Belmont’s influence, research ethics 
became acentral principle of modern science. 

Bioethics flourished throughout the 1980s, 
expanding to include equity in public health 
and access to medical care. The field became 
increasingly central to medical and scientific 
training, as well as to research funding. That 
focus was intensified by the ‘too little, too late’ 
critiques of government responses to the HIV 
crisis that emerged in the mid-1980s. 

Bioethics gathered momentum at this 
time by offering guidance on controversial 


biomedical applications such as organ 
transplants and in vitro fertilization (IVF). In 
the first encyclopaedia of bioethics, published 
in1978, theologian Warren T. Reich drew atten- 
tion toa key shift in the practice of medicine: 
one that moved away from a commitment to 
preserving life®. In the past, he argued, medi- 
cine was guided by the absolute principle to ‘do 
no harm’. However, a different ethical dilemma 
arose out of aheart transplant, a procedure that 
could significantly improve an individual’s 
quality of life but which also had the potential 
to killthem. In contrast to the iron-clad medi- 
cal ethics of old, the absolute value of human 
life became relativized. In the world’s most 
advanced medical facilities, a higher quality of 
life could now be worth dying for. Once again, 
ethical debate was reignited. 

It was in the 1990s that professionalized 
bioethics reached its high-water mark. The 
Human Genome Project (HGP) — the leviathan 
of publicly funded DNA sequencing — prom- 
ised to unleash a combination of Darwin’s and 
Galton’s visions as the century drew to aclose. 
Ethics claimed the largest share of HGP funds 
set aside for the analysis of “Ethical, Legal, and 
Social Implications” (ELSI) of genome mapping. 


“Bioethics, once a beacon 
of principled pathways to 
policy, isincreasingly lost.” 


Inthe United States alone, around US$3 billion 
(5% of the HGP budget) was spent to create “the 
world’s largest bioethics program’. Armies of 
ethicists combed over the philosophical prin- 
ciples of altering genetic material in ways that 
might or might not be passed on to future 
generations, and the perils of designer babies. 

Then, as the century neared its end, some- 
thing else took centre stage: new techniques 
derived from reproductive and developmental 
biology, suchas cloning and research into stem 
cells and embryos. As the prospects of quick 
bench-to-bedside applications from the HGP 
faded, so did the allure of bioethics. The disci- 
pline lost its most significant source of funding 
as ELSI programmes ceased. 

Toreturnto 1978, there was another turning 
point for bioethics in the year of The Belmont 
Report: the birth of Louise Brown, the first baby 
conceived through IVF. Some of the most con- 
troversial research and applications over the 
past half-century have concerned reproductive 
and developmental biology. But while bioeth- 
icists were recruited en masse to contemplate 
the impact of the HGP, the fertility industry 
mushroomed, generating an impressive set 
of acronyms (but no ELSI). For a while, global 
public opinion became more sharply divided 
over cloned dogs and genetically modified 
(GM) maize (corn) than GM babies. That con- 
cern would come later. 
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In retrospect, many of the forces that 
propelled late-twentieth-century bioethics into 
the limelight — suchas the focus on speculative 
genomic futures — eventually left it unmoored. 
In the past two decades, bioethics has drifted 
into uncharted waters. Today, amid a panoply 
of ethical quagmires ranging from gene-edited 
babies and neurotechnology to dish-grown 
organoids and nanobots, the fraught relation- 
ship between society and researchis once again 
front and centre. 


Beyond bewilderment 


Just as the ramifications of the birth of mod- 
ern biology were hard to delineate in the late 
nineteenth century, so there is a sense of 
ethical bewilderment today. The feeling of 
being overwhelmed is exacerbated by a lack 
of regulatory infrastructure or adequate policy 
precedents. Bioethics, once a beacon of prin- 
cipled pathways to policy, is increasingly lost, 
like Simba, in a sea of thundering wildebeest. 
Many of the ethical challenges arising from 
today’s turbocharged research culture involve 
rapidly evolving fields that are pursued by glob- 
ally competitive projects and teams, spanning 
disparate national regulatory systems and 
cultural norms. The unknown unknowns grow 
by the day. 

The bar for proper scrutiny has not somuch 
been lowered as sawn to pieces: dispersed, 
area-specific ethical oversight now exists in 
a range of forms for every acronym from Al 
(artificial intelligence) to GM organisms. A 
single, Belmont-style umbrellano longer seems 
likely, or even feasible. Much basic science is 
privately funded and therefore secretive. And 
the mergers between machine learning and 
biological synthesis raise additional concerns. 
Instances of enduring and successful interna- 
tional regulation are rare. The stereotype of 
bureaucratic, box-ticking ethical compliance 
is no longer fit for purpose ina world of CRISPR 
twins, synthetic neurons and self-driving cars. 

Bioethics evolves, as does any other branch 
of knowledge. The post-millennial trend has 
been to become more global, less canonical 
and more reflexive. The field no longer relies 
on philosophically derived mandates codified 
into textbook formulas. Instead, it functions as 
a dashboard of pragmatic instruments, and is 
less expert-driven, more interdisciplinary, less 
multipurpose and more bespoke. In the wake 
of the ‘turn to dialogue’ in science, bioethics 
often looks more like public engagement — and 
vice versa. Policymakers, polling companies 
and government quangos tasked with organ- 
izing ethical consultations on questions such 
as mitochondrial donation (‘three-parent 
embryos’, as the media would have it) now 
perform the evaluations formerly assigned 
to bioethicists. Journal editors, funding bod- 
ies, grant-review boards and policymakers are 
increasingly the new ethical adjudicators. 

These shifts have been a long time coming 
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and have many different sources, including 
the driving influence of practical ethicists 
suchas the British philosopher Mary Warnock 
— often to the consternation of the wider bio- 
ethics community. After Warnock’s Report of 
the Committee of Inquiry into Human Fertilis- 
ation and Embryology was published for the 
UK government in 1984, John Harris, a med- 
ical ethicist at the University of Manchester, 
UK, complained that “the crucial questions 
are fudged, or rather are never addressed”. 
He argued that Warnock’s approach was 
over-reliant on “primitive feelings”, resulting 
in recommendations that were “false” and “dan- 
gerous”. The Warnock committee, in his view, 
had evaded the single most important question 
they faced concerning the moral status of the 
human embryo, in favour ofa sentimental con- 
cession to expedient policy. 

As we now know, Warnock was prescient 
in her attention to the strength of public feel- 
ing in relation to human-embryo research. 
Her reliance on several overlapping types of 
argument to justify strict limits on the intro- 
duction of new reproductive technologies 
has enabled the United Kingdom to establish 
alicensing system that is more flexible and that 
has proved more long-lasting than in any other 
country. Her committee, unusually compris- 
ing a majority of non-scientists, reached its 
consensus based ona pragmatic and princi- 
pled proposal: that approval for the study of 
controversial therapeutic and experimental 
procedures would be subject to a strict and 
comprehensive code of practice upheld by 
Parliament. The law itself, Warnock argued, 
would act as both a guarantor and a symbol 
of public morality; it would in its combination 
of permissive scope and legislative precision 
express “the moral idea of society”. This was a 


new template for ethical reasoning. 

When the UK government decided, in the 
wake of the passage of the Human Fertilisation 
and Embryology Act in 1990, that it would not 
establish a counterpart to the US National Bio- 
ethics Advisory Commission, it was following 
Warnock’s lead, and that of Anne McLaren. This 
developmental biologist and Warnock com- 
mittee member took a populist and practical 
approach to public trust in science that has 
been highly influential. 

Today, interdisciplinary expertise plus 
extensive and creative public consultation 
increasingly define a new approach to ethical 
science. This trend has been reinforced by 
organizations such as the Nuffield Council on 


“The most ethical science 
isthe most sociable one... 
scientific excellence depends 
on greater inclusivity.” 


Bioethics, which advises the UK government by 
mobilizing a broad spectrum of knowledge, far 
beyond that of bioethicists and philosophers. 
Since 1993, the council has commissioned 
and published nearly 30 specialist reports 
on controversial biomedical issues, ranging 
from genetic screening to xenotransplanta- 
tion. Few of the panels have been chaired by 
bioethicists. Many of the reports have widened 
the idea of what counts as an ethical issue — 
for example, the exploration of cultures of UK 
scientific research, chaired by the University 
of Cambridge plant scientist Ottoline Leyser”. 
Ina similar vein, the International Society for 
Stem Cell Research has released a series of 
global guidelines that prioritize oversight, 
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communication and research integrity as well 
as patient welfare, social justice and respect 
for study participants over fixed principles of 
ethical conduct. 

Inasocial-media-saturated age wary of fake 
news, the new holy grail is the ability to create 
trustworthy systems for governing controver- 
sial research such as chimeric embryos and 
face-recognition algorithms. The pursuit of a 
more ethical science has come to be associated 
with building trust by creating transparent pro- 
cesses, inclusive participation and openness 
to uncertainty, as opposed to distinguishing 
between ‘is’ and ‘ought’. 

Inshort, expert knowledge and reliable data 
are essential but never enough to enable endur- 
ing, humane governance to emerge. So there 
is now more emphasis on continuous com- 
munication and outreach, and on long-term 
strategies to ensure collective participation 
and feedback at all stages of scientific inquiry. 
The result is less reliance on specialized ethical 
expertise and more attention to diversity of 
representation. 

Amid the perils and promises of applica- 
tions, from replacement heart and liver cells 
or driving malarial resistance through the 
mosquito population to ending Huntingdon’s 
disease, anew legacy to Darwin and Galton has 
emerged. It turns out that what we have in com- 
monislessa single biological essence — or the 
ability to alter it— than ashared responsibility 
for human and non-human futures. The impli- 
cation of this new model is that the most ethical 
science is the most sociable one, and thus that 
scientific excellence depends on greater inclu- 
sivity. We are better together — we must all be 
ethicists now. 
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Pregnant women in Lima, Peru, painted each other in 2013 to help a local hospital to raise awareness of good maternity health care. 


Prevent depression in pregnancy to 
boost all mental health 


Ricardo F. Munoz 


Tackling mental disorders 
before they arise in pregnant 
women and new mothers is an 
approach that could be scaled 
up online — and would aid the 
overall health of populations. 


have been convinced of the importance of 
prevention in addressing mental-health 
problems since the early 1970s, when I 
began my doctorate in clinical psychol- 
ogy. But only now is there sufficient evi- 
dence from clinical trials of the effectiveness 
of preventive interventions, using approaches 
derived from interpersonal and cognitive 
behavioural therapy, to justify deploying 
them. And only now are the tools available to 
make such interventions available to people 
worldwide. 
Two recent reports underline this 
conclusion. In February, the US Preventive 
Services Task Force, an independent panel of 


experts in evidence-based medicine, urged 
clinicians to “provide or refer pregnant and 
postpartum persons who are at increased risk 
of perinatal depression to counseling interven- 
tions”. And last month, the US National Acad- 
emies of Sciences, Engineering, and Medicine 
(NASEM) released a report? calling on various 
stakeholders, from educators to policymak- 
ers, to prevent mental-health disorders and to 
promote healthy mental, emotional and behav- 
ioural development in the under 25s. (I was a 
member of the committees that prepared this 
document and two previous NASEM reports in 
1994 and 2009 on preventive interventions**.) 

The latest NASEM call to action? is so 
all-encompassing, it is hard to know where to 
begin. I propose that initial efforts focus on 
preventing depression in pregnant women 
or in women who have recently given birth 
(perinatal depression). There is substantial 
evidence for the effectiveness of providing 
such women with basic skills in mood man- 
agement’. These interventions could have an 
impact across generations, because better 
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maternal mental health is linked to babies’ 
healthier development’. And if researchers 
and health-care systems were to monitor and 
compare the epidemiology of depression in 
thousands of mothers and their children in 
areas that have or have not deployed preven- 
tive interventions, stakeholders could measure 
their effect on entire communities. 

Ultimately, massive open online interven- 
tions will need to be created (similar to the 
massive open online courses that are deliv- 
ered on the Internet for free). These would 
allow anyone to obtain information and tools 
to help them stave off depression, at times and 
places that are convenient to them. 


A global challenge 

In the United States, nearly 15% of men and 
around 26% of women experience a major 
depressive episode at some point®. People are 
diagnosed with this ifthey report experiencing 
five of nine symptoms over at least two weeks. 
These must include either feeling depressed 
or being unable to feel interest or pleasure, as 
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well as problems sleeping, changes in appetite, 
fatigue or having suicidal thoughts. 

Numerous psychological, pharmacological 
and physical treatments are effective, such as 
cognitive behavioural therapy, antidepressant 
drugs and electroconvulsive therapy. But many 
people who are depressed are not receiving 
treatment’ because they fear stigma, can’t get 
to clinics or afford treatment, or because there 
aren’t enough psychologists and psychiatrists 
to meet their needs. 

Given these challenges — and especially given 
the scale of the problem — societies worldwide 
need to take steps to stop depression from 
taking hold in the first place. 

The number of randomized controlled trials 
testing preventive interventions has greatly 
increased since 1995 (see ‘Mounting data’). 
Two approaches have been studied the most: 
cognitive behavioural therapy and interper- 
sonal therapy. The first involves teaching 
people how to use the natural relationship 
between thoughts, behaviours and mood to 
increase those thoughts and behaviours that 
lead to healthy mood states — and to reduce or 
modify those that elicit sadness, helplessness 
and hopelessness. (People might be asked, for 
instance, to predict how their mood would 
change if they undertook certain activities, 
such as seeing a friend — and then to record 
how their mood actually changed following the 
activity.) The second approach, interpersonal 
therapy, helps people to communicate better 
with others, and so to obtain more support 
from friends and family. 

Inthe early 2000s in California, for example, 
my colleagues and I at San Francisco General 
Hospital (now the Zuckerberg San Francisco 
General Hospital and Trauma Center) con- 
ducted a pilot study funded by a US National 
Institute of Mental Health grant. The study 
involved 41 Spanish- and English-speaking 
women, most of whom were in their 16th week 
of pregnancy. These women were not clinically 
depressed but were deemed to be at high risk 
because they scored 16 or more ona depression 
scale, or had a history of major depressive epi- 
sodes. The preventive intervention we used in 
this case involved psychologists teaching a cog- 
nitive behavioural ‘Mothers and Babies/Mamas 
y Bebés’ course in 2-hour sessions once a week 
for 12 weeks. Only 14% of the women taking the 
course had a depressive episode in the follow- 
ing year, compared to 25% in the control group®. 

A meta-analysis of 32 studies in 2014 showed 
that, inallsorts of groups that are at risk — from 
expectant and new mothers to individuals 
who'd experienced a stroke — such preven- 
tive interventions reduce the onset of major 
depressive episodes by 21%, on average’. Inthe 
same year, my colleagues and I found that 15 of 
42 randomized trials reported reductions of 
50% or more in the incidence of depression”. 

Then, this year, the US Preventive Services 
Task Force reviewed 50 randomized controlled 
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Baby yoga in Sao Paulo, Brazil, which some new mothers find helps them to avoid depression. 


trials testing preventive interventions 
specifically for perinatal depression. This has 
shown that, on average, such interventions 
reduce the incidence of major depressive 
episodes by 39%. However, one interpersonal 
approach, called ROSE, reduces the incidence 
of episodes by 50%, and the Mothers and 
Babies intervention reduces the incidence of 
episodes by 53% (ref. 5). 

In short, the data suggest that if we 
implement interventions that seem to be the 
most effective in clinical trials, we could halve 
the new cases of major depression. 

So why focus on expectant or new mothers? 
I propose an initial focus on perinatal depres- 
sion for four reasons. The evidence is strong. 
The window of risk is clear (during pregnancy 
and fora year after giving birth). Education and 
mood-management skills could be wrapped 
into the prenatal classes or home visits many 
pregnant womenalready receive, lowering cost 
and stigma — as was done ina 2010 study involv- 
ing more than 2,000 women in the Trent area 


MOUNTING DATA 


The number of randomized controlled trials testing 
preventive interventions for depression has soared. 
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of England”. Most importantly, interventions 
could benefit multiple generations. A mother’s 
depression is associated with lower than aver- 
age birthweight and preterm deliveries, as well 
as problems in children such as impaired cog- 
nitive development’. Conversely, the healthy 
development of babies and children could result 
in their having healthier, planned pregnancies 
when they themselves reach childbearing age. 


Making it happen 


Of course, rolling out evidence-based preven- 
tive interventions to millions of womenat risk 
of perinatal depression all over a nation, or the 
world (many of whom don't have access to pre- 
natal or postnatal care), as well as to other high- 
risk groups such as adolescents, is a daunting 
proposition. The number of therapists or 
health workers available to provide cognitive 
behavioural courses or interventions based 
oninterpersonal therapy is vastly inadequate, 
even when it comes to treatment. In 2013, for 
example, an estimated 43.8 million adults in the 
United States had experienced a mental illness 
in the past year, yet only 19.6 million received 
mental-health services. The World Health 
Organization estimates that, worldwide, more 
than 300 million people of all ages experience 
depression. Most do not receive treatment. A 
different strategy is required. 

In 1998, with support from the Tobacco- 
Related Disease Research Program in Oakland, 
California, my colleagues and I began build- 
ing an online resource to help people to quit 
smoking. We then conducted a randomized 
controlled trial in Spanish and English to deter- 
mine whether people’s use of the site could gen- 
erate quit rates comparable to those obtained 
from current smoking-cessation aids, such as 
the nicotine patch. (After six months, the quit 
rates of people using nicotine patches in the 
United States are 14-22% (refs 12, 13).) 

After registering on the site, people were 
able to access a guide on howto stop smoking. 
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They could submit their ‘quit date’ and would 
then receive e-mails advising them on what 
steps to take as the date drew near. They were 
given instructions on how to manage their 
moods, and encouraged to keep diaries as 
part of the mood-management training. They 
also became part of an online community that 
offered support and information. 

Our sample consisted of 1,000 smokers from 
68 countries, 69% of whom provided follow-up 
data after one year. (If we didn’t get a response 
to our e-mails or phone calls, we assumed that 
person had resumed smoking.) In our study, 20% 
of Spanish speakers and 21% of English speakers 
quit’. In other words, we ‘matched the patch’. 

Instead of shutting down the website at the 
end of the grant period, we continued to run 
the trial with a donation from the Brin Wojcicki 
Foundationin San Francisco. Over the following 
6 years, data from more than 34,000 smokers 
from168 countries generated similar results>"*. | 
realized then that our open online interventions 
were very similar to the now-popular massive 
open online courses. We had, in fact, carried 
outa proof-of-concept study ofa massive open 
online intervention, or MOOF’®. 

Various online interventions have already 
helped to reduce symptoms of depression”. 
Australia’s moodgym programme, launched in 
2001, is one of the oldest global online interven- 
tions for depression. With morethan one million 
registered users, it could well be the most widely 
used computerized cognitive-behavioural pro- 
gramme in the world®. And this month, the UK 
National Health Service launched an online cam- 
paign called Every Mind Matters to help people 
maintain their mental health. 

The interventions I’m calling for would be 
similar to these, but would need to be built such 
that their effectiveness could be evaluated on 
an ongoing basis. Also, effectiveness would 
need to be made transparent through continu- 
ally updated ‘box scores’ on home pages; these 
could indicate, for instance, that the interven- 
tion resulted in a substantial improvement for 
20% of 1,000 individuals. People would learn 
to look for the websites or apps that display 
effectiveness data, just as they look for services 
or films that have high ratings. 

MOOIs (websites, apps, text-based 
interventions, and so on) could be provided at 
no charge to every expectant or new mother in 
the world —as wellasto other groupsat risk, such 
as adolescents, people who have lost aloved one 
or those who are experiencing physical-health 
problems. In communities where few people 
have access to the Internet, health clinics could 
provide resource rooms where people could 
access MOOIs. And in remote locations where 
there are no clinics, local providers could use 
tablets, laptops or mobile phones to share 
MOOIs with the people they serve. In fact, the 
Mothers and Babies course is already being 
implemented in Tanzania and Kenya. 

Some might argue that large-scale 


implementation is premature if the risk of 
developing clinical depression can be cut 
by only 50%. But such rates of reduction are 
comparable to those for reducing the risk of 
influenza through vaccination (40-60%; see 
go.nature.com/2wjkr93). 

Few online and smartphone apps for mental 
health have been rigorously tested. And some 
in the field might be concerned that MOOIs 
could be useless — or worse, harmful. All MOOIs 
would need to be evidence-based. 

Another potential concern is that, once 
MOOls become available, insurance com- 
panies could refuse to reimburse people for 


“We have the knowledge and 
the tools to create a world 

in which fewer people ever 
experience depression.” 


in-person sessions with counsellors. And 
there is the worry that MOOIs might exacer- 
bate inequities, with wealthy people receiving 
cognitive behavioural therapy from therapists 
and poorer people having access only to online 
resources. But risks must be weighed against 
benefits for every intervention. 


Thinking even bigger 
So what about the other 50%, whose depression 
is harder to head off? 

As all three NASEM reports? * point out, 
genetics, other biological factors — suchas virus 
infections or hormone disorders — and people’s 
social and physical environments interact to 
have a major impact on mental health. 

Epigenetics, the study of heritable altera- 
tions of the genome structure that are environ- 
mentally induced and don’t involve changes 
to the DNA sequence itself, is revealing how 
life events affect gene expression and the 
development of mental disorders. Much of 
the variation in DNA methylation that occurs 
during the first month of an infant’s life, as well 
as their weight at birth and even some child- 
hood behaviours, have been associated with 
prenatal environmental factors, such as the 
mother’s smoking habits, mental health and 
body weight”. 

The impact of the social environment on 
development has also been documented in 
detail. There is growing evidence that nurtur- 
ing environments (achieved by rewarding good 
performance at school rather than punishing 
poor performance, for instance) have a major 
impact on children’s healthy development”. 

To reach the other 50%, we need to expand 
beyondindividually focused interventions. The 
2019 NASEM report? recommends that clinical 
researchers, health-care providers and policy- 
makers should systematically study and imple- 
ment more-ambitious interventions that focus 
onchildren’s social and physical environments. 
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Soon after arriving at the University of Oregon 
in Eugene in 1972 to start my doctorate in clin- 
ical psychology, I attended a talk at the local 
community mental-health centre. The speaker 
chided the professionals inthe room, saying, in 
essence, “We therapists sit in our offices waiting 
for people to suffer enough to come to see us, 
or to be brought in by their family or the police 
because they are being disruptive. We should be 
going out into the community and sharing what 
we have learnt so that people can prevent the 
mental, emotional and behavioural problems 
that bring them to our offices.” 

That evening, I decided to devote much of 
my professional work to the prevention of 
mental disorders. 

Forty-seven years later, we have the knowl- 
edge and the tools to create a world in which 
fewer people ever experience clinical depres- 
sion and other mental disorders. Let’s start 
creating it. 
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Readers respond 


Correspondence 


Nobels, gender and 
ethnicity 


As secretary-general of the Royal 
Swedish Academy of Sciences 
and secretary of the Nobel 
Committee for Chemistry, we 
share your concerns about 

the shortage of women and of 
scientists from outside Europe 
and North America among 
Nobel laureates (see Nature 574, 
295; 2019). 

However, you are incorrect in 
saying that we invite only “elite 
universities and academies” to 
nominate candidates for the 
Nobel prize. We make substantial 
efforts to approach research 
universities across the world. 
Each year, we request lists of 
faculty members from about 
200 such universities, selected 
from some 1,600 institutions 
ona rotational scheme. We then 
send individual nomination 
forms to each professor. 

The inequitable distribution 
of Nobel prizes isa symptom ofa 
bigger problem. Science has been 
dominated by Western Europe 
and North America for centuries, 
and women have had limited 
scientific opportunities. For 
example, fewer than 15% of senior 
authors in Nature are women 
(Y. A. Shen etal. Nature 555, 165; 
2018) and just 2% of your authors 
are from Africa, South America or 
western Asia (Nature Index 2019). 

Our award process strives 
to ensure that all scientists get 
a fair chance, irrespective of 
geography or gender. Asa small 
contribution, we are launching a 
programme of Nobel Symposia 
in Africa. But others must also 
work to improve the situation — 
by encouraging women to 
pursue science careers and 
by supporting researchin 
low-income countries. 


Goran K. Hansson, Gunnar 

von Heijne The Royal Swedish 
Academy of Sciences, Stockholm, 
Sweden. 

goran.hansson@kva.se 


India — science and 
social responsibility 


India issued a draft national 
policy in September for social 
responsibility in science (see 
go.nature.com/32sihv2). 

Its aim is to strengthen the 
country’s knowledge ecosystem, 
improve communication 
between science and society, 
and translate research into 
social benefits. A central 

agency and a national digital 
portal will oversee the policy‘s 
implementation, which is 
currently being widely discussed 
in the scientific community. 

This policy for scientific social 
responsibility (SSR) is founded 
onscientists’ ethical obligation 
to give back to society in return 
for the taxpayers’ money that 
funds their research. It will 
promote scientific solutions 
for societal problems such 
as rural deprivation and the 
disempowerment of women, 
and improve scientific and 
technological support for 
industry. Funders will be 
expected to make SSRa 
condition for awarding grants. 

India’s scientific community is 
crucial for the implementation of 
the new policy. Researchers will 
be required to spend a minimum 
of 10 days every year in public 
engagement, andto share their 
knowledge, resources, data sets 
and equipment to accelerate the 
advancement of science. Credit 
for SSR efforts will be given to 
researchers in their performance 
evaluations. 

Once this policy takes effect, 
India could lead the way in 
making science and scientists 
worldwide more socially 
responsible. 


Abhay S. D. Rajput Indian Institute 
of Tropical Meteorology, Pashan, 
India. 

abhaysdr@tropmet.res.in 
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Chile — right to free 
will needs definition 


Chile could soon become the 
first country to incorporate 
‘neurorights’ into its constitution 
to prevent the misuse of 
artificial intelligence and 
neurotechnology (see go.nature. 
com/35qdje5). These rights 
relate to the way in which mental 
processes can be monitored and 
influenced, principally through 
brain—computer interfaces. 

The country’s Senate has 
launched a project designed 
to protect five neurorights 
proposed by Rafael Yuste, a 
neurobiologist at Columbia 
University in New York City 
(see go.nature.com/33trrmc 
and R. Yuste et al. Nature 551, 
159-163; 2017). These include 
the right to ‘free will’. However, 
the precise meaning of free will 
should be carefully debated 
before it is incorporated asa 
national or international right. 

Free willis a multidimensional 
concept that poses several 
unsolved philosophical 
problems. Most prominent is 
whether free will is compatible 
with determinism. Cultural 
diversity can also influence its 
interpretation (see N. Chernyak 
etal. Dev. Psychol. 55, 866-876; 
2019). Furthermore, different 
types of free choice and action 
have already been included in the 
Universal Declaration of Human 
Rights (see articles 16.2, 18 and 
21.3; go.nature.com/33t9bhn). 

We should therefore 
work towards developing a 
consensual, minimal definition 
of free will. Although sucha 
definition might not close the 
philosophical debate, it could 
be ethically operational in 
that it would help to pre-empt 
misinterpretations based on 
legislative loopholes. 


José M. Mufioz European 
University of Valencia, Spain. 
josemanuel.munoz@ 
universidadeuropea.es 
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Test reproducibility 
of old computer code 


We question whether analytical 
tools such as Common Workflow 
Language, which aim to make 
computational methods 
“reproducible and shareable’, 
can stand the test of time (see 
Nature 573, 149-150; 2019). 
The long-term validity of 
computational results will not 
be testable if the original code 
cannot be run many years later. 

Considering the rapidity of 
transformations in operating 
systems and programming 
languages, it is hard to predict 
the lifetime reproducibility 
of a particular code. We have 
therefore organized the Ten 
Years Reproducibility Challenge 
(see go.nature.com/2bwcukq). 
Researchers are invited to test 
code reproducibility by trying 
to rerunacode created fora 
scientific paper they published 
more than ten years ago. The 
codes can address any scientific 
domain (statistical analysis, 
numerical simulation or data 
processing, for example) and be 
written in any language. 

The challenge closes in April 
2020. Our hope is that the results 
will offer insights into long-term 
causes of non-reproducibility. 


Konrad Hinsen Centre for 
Molecular Biophysics, CNRS, 
Orléans, France. 
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Cell biology 


Senescent cells feed 
on their neighbours 


Michael Overholtzer 


Chemotherapy-treated cancer cells that enter anon-dividing 
state called senescence can nevertheless boost cancer growth. 
The finding that these cells eat neighbouring cells reveals a 
mechanism that enables senescent cells to persist. 


Multicellular life requires individual cells to 
cooperate ina way that benefits the organism. 
Cells that are uncooperative because they are 
damaged or dysfunctional, and that pose a 
threat, are either eliminated by cell death or 
undergo a usually irreversible growth arrest 
called senescence’. Senescent cells typically 
never divide (although there are some rare 
examples of cells exiting senescence and 
resuming division), but they can persist in 
tissues and contribute to ageing and cancer 
progression””. Writing in the Journal of Cell 


SASP response A 
promotes tumour survival @ Cytokine 


Biology, Tonnessen-Murray et al.* reveal a 
deadly activity that underlies the persis- 
tence of senescent cells — they can eat their 
neighbours alive. 

Cellular entry into senescence benefits an 
organism because it inhibits cancer devel- 
opment by preventing the division of cells 
that have accumulated extensive DNA dam- 
age or that express cancer-promoting genes 
called oncogenes’. Senescent cells are 
metabolically active®, and this is character- 
ized by their secretion of pro-inflammatory 
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Figure 1| Cellular cannibalism. Chemotherapy drugs can cause cancer cells to enter a state of senescence, 
which is usually associated with an irreversible halt to cell division. However, these cells can promote tumour 
survival by secreting growth factors and signalling molecules called cytokines in a process termed the 
senescence-associated secretory phenotype (SASP) response’. Tonnessen-Murray et al.‘ report studies of 
breast cancer in mice which reveal that this type of senescent cell takes up (engulfs) and digests neighbouring 
living cells. The cells are engulfed by a process that has molecular characteristics of phagocytosis, an 
engulfment process that immune cells use. Once ingested, the cells are enveloped in membrane from an 
organelle called the lysosome and digested. This might account for a portion of the numerous lysosomes that 
area hallmark of senescent cells. This degradation provides metabolic building blocks for the cell. Senescent 
cells that have ingested their neighbours survive for longer than senescent cells that have not. 
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molecules as part of aphenomenon termed 
the senescence-associated secretory pheno- 
type (SASP) response’. Senescent cells can 
promote cancer progression and resistance 
to anticancer therapy in some contexts, as a 
result of the secretion, through SASP, of growth 
factors and immune-signalling molecules 
called cytokines’. 

Chemotherapy that damages the DNA of 
cancer cells can result in their death or their 
entry into senescence. Tonnessen-Murray 
and colleagues investigated the effects of 
chemotherapy-driven senescence in breast 
cancer cells in mice treated with the chem- 
otherapeutic drug doxorubicin. Under the 
microscope, they saw senescent cells eating 
and digesting entire neighbouring cells (Fig. 1). 
This striking observation was made in breast 
tumours formed of mixtures of transplanted 
cancer cells, which were engineered to express 
red or green fluorescent proteins. It can be dif- 
ficult to observe a cell being internalized by 
another cell (a process termed engulfment) in 
cancer tissues. By growing tumours with mix- 
tures of fluorescently labelled cells, the authors 
could clearly identify red- or green-labelled 
cells being taken up into neighbouring cells 
labelled by the other colour. 

Engulfment also occurred at high rates for 
mouse and human breast cancer cells grown 
invitro and treated with doxorubicin or another 
chemotherapeutic drug, paclitaxel. Ingestion 
peaked at 4-6 days after drug treatment, a 
time that correlated with the induction of 
senescence. The cells that were engulfed by the 
senescent cells were neighbouring senescent 
ornon-senescent cancer cells. They showed no 
sign of being dead, and engulfment occurred 
even in the presence of a cell-death inhibitor 
molecule. This led the authors to conclude that 
the ingested cells were being eaten alive. 

Ingested cells are broken down ina digestive 
organelle called the lysosome. Crucially, senes- 
cent cells that ate their neighbours survived 
longer in vitro than those that did not. This 
finding suggests that metabolic building 
blocks retrieved from the lysosomal diges- 
tion of neighbouring cells were being used by 
senescent cells to promote their survival. 

This surprising finding that cell death sup- 
ports the survival of senescent cells highlights 
the complexity of cell-death regulation in 
multicellular animals. Numerous mechanisms 
of cell death occur in animal tissues. These 
include forms of cell suicide, such as apoptosis, 
which leads to the fragmentation of individual 
cells, and regulated forms of necrotic cell death 
that induce cell rupture’. Some cell deaths are 
also carried out as ‘murders’”*. These typically 
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involve the presence of engulfing cells, and 
occur by at least two distinct mechanisms’. 

One is a form of cell death called entosis, in 
which living cells that are destined to die invade 
a neighbouring cell and become engulfed”. 
Another mechanism is cellular cannibalism, in 
which living cells that will be ingested are tar- 
geted by a type of engulfment that resembles 
phagocytosis — the process typically used by 
immune-system cells such as macrophages to 
ingest and destroy dying cells’. Such cellular 
murders can support the survival of particu- 
lar cells in a population that benefit from the 
metabolic banquet derived from ingesting and 
degrading whole cells”. 

The authors examined the mechanism of 
senescence-associated engulfment and found 
that, although entosis could occur in the type 
of tumour cell studied, the engulfment of 
senescent cells did not involve the proteins 
required for entosis’°. The authors analysed the 
gene-expression profile of cancer cells treated 
with chemotherapy drugs (most of these cells 
were senescent), and found that genes charac- 
teristic of phagocytosis were expressed. This 
gene expression peaked within a timeframe 
that correlated with the cellular engulfment. 
Senescent cells were also observed to engulf 
dead cells added in vitro, providing further evi- 
dence for the authors’ model that senescent 
cells engulf cells by phagocytosis. 

Cell cannibalism in cancers has been 
reported previously’””. However, Tonnessen- 
Murray etal. specifically identify an association 
between cannibalism and senescence, and 
show that this phenomenon might make a 
substantial contribution to the persistence of 
senescent cells in cancer tissues. The authors 
observed that cannibalism by senescent breast 
cancer cells occurs irrespective of whether 
or not the cell has functional p53, a notable 
tumour-suppressor protein that can control 
entry into senescence”. The authors tested 
chemotherapy-induced senescent cells of other 
types of cancer, including lung cancer anda 
bone cancer called osteosarcoma, and found 
that these cells also cannibalize neighbouring 
cells. Together, these findings suggest that cell 
cannibalism might be an activity that is broadly 
associated with the induction of senescence, 
rather than being linked to particular types 
of cancer or to the status of proteins such as 
p53. It will be important to investigate whether 
cannibalism is linked to senescence in other 
contexts, for example during tissue develop- 
ment when senescence can occur", or inaged 
tissues that accumulate senescent cells*. 

Entosis in cancer-cell populations can 
promote competition between individual cells 
in which ‘winner’ cells ingest and kill neigh- 
bouring ‘loser’ cells, removing them from the 
population’®. Whether cells behave as winners 
or losers depends on certain cellular character- 
istics, for example differences in the tension 
of the internal cellular framework called the 
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cytoskeleton”. It would be interesting to 
investigate whether senescent cells choose 
particular target cells to cannibalize ina com- 
petitive fashion. In cancers, complex mixtures 
of cells coexist in the tumour microenviron- 
ment, and this cellular composition changes 
over time or inresponse to anticancer therapy. 
The authors propose that cell cannibalism 
might affect cancer progression by supporting 
the SASP response. However, itis worth consid- 
ering whether it might also contribute directly 
to cancer progression by removing particular 
cells from the tumour microenvironment. 
And if normal cells are found to be removed 
by senescent cells in aged tissues, this deple- 
tion might contribute directly to tissue 
degeneration. 


Michael Overholtzer is in the Cell Biology 
Program, Memorial Sloan Kettering Cancer 
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Center, New York, New York 10065, USA. 
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Fresh evidence inthe 
glacial-cycle debate 


Eric W. Wolff 


An analysis of air up to 2 million years old, trapped in Antarctic 
ice, shows that a major shift in the periodicity of glacial cycles 
was probably not caused by along-term decline in atmospheric 


levels of carbon dioxide. See p.663 


During the past 2.6 million years, Earth’s climate 
has alternated between warm periods known 
as interglacials, when conditions were similar 
to those of today, and cold glacials, when ice 
sheets spread across North America and north- 
ern Europe. Before about 1 million years ago, 
the warm periods recurred every 40,000 years, 
but after that, the return period lengthened to 
an average of about 100,000 years. It has often 
been suggested that a decline in the atmos- 
pheric concentration of carbon dioxide was 
responsible for this fundamental change. On 
page 663, Yan etal.'report the first direct meas- 
urements of atmospheric CO, concentrations 
from more than 1 million years ago. Their data 
show that, although CO, levels during glacials 
stayed well above the lows that occurred during 
the deep glacials of the past 800,000 years, the 
maximum CO, concentrations during inter- 
glacials did not decline. The explanation for the 
change must therefore lie elsewhere. 
Understanding what caused the shift in 
periodicity, knownas the mid-Pleistocene tran- 
sition (MPT), is one of the great challenges of 
palaeoclimate science. The 40,000-year peri- 
odicity that dominated until about 1 million 
years ago is easily explained, because the tilt 
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of Earth’s spin axis relative to its orbit around 
the Sun varies between 22.1° and 24.5° with the 
same period. In other words, before the MPT, 
lowtilts led to cooler summers that promoted 
the growth and preservation of ice sheets. 

But after the MPT, glacial cycles lasted for 
two to three tilt cycles. Because the pattern 
of variation in Earth’s orbit and tilt remained 
unchanged, this implies that the energy needed 
to lose ice sheets” had increased. One prom- 
inent explanation’ is that atmospheric levels 
of CO, were declining, and eventually crossed 
a threshold value below which the net cool- 
ing effect of the decline allowed ice sheets to 
persist and grow larger. 

Ancient air trapped in Antarctic ice can be 
extracted from cores drilled from the ice sheet, 
allowing the CO, concentration to be meas- 
ured directly, but the ice-core record extends 
to only 800,000 years ago*. Estimates of CO, 
concentrations from earlier periods have 
been made by measuring the ratio of boron 
isotopes in shells found in ancient marine 
sediments**. This proxy measurement depends 
onachemical equilibrium controlled by ocean 
acidity, which, in turn, is closely related to the 
atmospheric CO, concentration. 


SEAN MACKAY 


But the estimates of CO, levels inferred from 
such measurements are necessarily imprecise 
and must be verified using more-precise, direct 
measurements. Scientists have therefore for- 
mulated plans’ to find and retrieve deep ice 
cores that reach back to before the MPT (see 
go.nature.com/33mw4yk). One project has 
recently been funded by the European Union, 
and hopes to retrieve million-year-old ice 
in 2024. 

Yan et al. tried another approach to finding 
similarly old ice, but nearer the surface of 
Antarctica. In regions knownas blue-ice areas, 
the combination of ice flow against amountain 
barrier and surface ice loss by wind scouring 
and sublimation (transformation of ice directly 
into water vapour) leads to upwelling of old 
ice towards the surface. The authors therefore 
studied two cores, 147 and 191 metres deep, that 
were drilled to bedrock in the blue-ice region 
near the Allan Hills in Antarctica (Fig. 1). 

The researchers improved and applied a 
relatively new method* to date this old ice. 
The concentration of argon-40 in Earth’s 
atmosphere is slowly increasing with time as 
it is produced from the radioactive decay of 
potassium-40. By measuring the ratios of argon 
isotopes in air extracted from cores, the age of 
ice can be determined. The authors also meas- 
ured the ratios of deuterium (a heavy isotope 
of hydrogen) to hydrogenin the ice, which can 
be used as a proxy of temperature at the time 
the ice was deposited. 

Yan and colleagues concluded that ice in 
the lowest 30 m of each core is up to 2.7 mil- 
lion years old. However, the uncertainty of 
100,000 years in this dating precludes their 
samples from being matched to particular 
parts of Earth’s tilt cycle. Moreover, the authors 
found abrupt age discontinuities with depth 
in the cores, which suggests that the layers 
of ice within them have been disturbed. The 
authors therefore treated the measured con- 
centrations of deuterium and CO, as snapshots 
of climate and atmospheric composition that 
corresponded to an approximate age of the 
ice, rather than as an ordered time series. On 
the basis of the deuterium values, they makea 
plausible case that the observed range of meas- 
ured CO, values represents most of the actual 
glacial—interglacial range. 

Unfortunately, in the oldest ice samples, 
there was evidence that the CO, concentration 
had been artificially enhanced by gas produced 
from the breakdown of organic material at 
the base of the ice sheet. A few samples from 
about 2 million years ago were potentially not 
affected by this issue, but were insufficient in 
number to allow any conclusions to be drawn 
about the range of CO, levels at that time. 

However, Yan et al. obtained samples 
from about 1 million and 1.5 million years 
ago that they consider to be undisturbed by 
the artificial addition of CO,. In both peri- 
ods, the maximum CO, concentrations are 
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Figure 1| Blue ice near the Allan Hills region of Antarctica. The environmental conditions in this area draw 
ancent ice to the surface. Yan et al.' have analysed air trapped in an ice core drilled from this region to obtain 
the first direct measurements of atmospheric carbon dioxide levels from more than 1 million years ago. 


similar to those of interglacials from the past 
500,000 years, peaking at 279 parts per million 
(p.p.m.). But the minimum value of 214 p.p.m. 
is much higher than the lows of around 
180 p.p.m. that occurred during recent glacial 
maxima (the periods that corresponded tothe 
maximum extent of ice). 

The authors conclude that the relationship 
between CO, levels and Antarctic temperature 
was similar before and after the MPT. The fact 
that the pre-MPT ice does not contain very low 


“These data force us 
tolook elsewhere for 
the cause of the longer 
glacial cycles.” 


ratios of deuterium to hydrogen that would 
be characteristic of extremely cold Antarctic 
temperatures, nor low CO, levels characteris- 
tic of recent glacial maxima, is probably just a 
consequence of the shorter period of the gla- 
cial cycles. Such low values are generally not 
found in the first 40,000 years of post-MPT 
glacial cycles either. 

Although Yan and colleagues’ data points 
cannot be placed within a tilt cycle, it seems 
likely that the CO, concentrations are not very 
different at the crucial points in cycles when 
the ice sheet is either lost (before the MPT) or 
continues growing (after the MPT). This forces 


© 2019 Springer Nature Limited. All rights reserved. 


us to look elsewhere for the cause of the longer 
cycles, perhaps refocusing efforts on under- 
standing whether changes to the nature of the 
ice-sheet bed caused by glacial erosion’ altered 
the characteristics of the ice sheets and their 
vulnerability to melting. 

Yan and colleagues’ data add much-needed 
precision to the previously reported estimates 
of CO, levels made using data from marine sed- 
iments>*. However, their tantalizing snapshots 
of the pre-MPT world emphasize the need for 
acomplete, undisturbed time series of green- 
house-gas concentrations that can be put into 
context with the climate cycles at that time. 
Let us hope that the planned newice cores will 
provide that. 


Eric W. Wolff is in the Department of 
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Bacterial twist to an 
antiviral defence 


Karen L. Maxwell 


The discovery of an antiviral defence system in bacteria that 
shares some components witha key antiviral defence pathway 
in animals provides insight into how this important response 


might have evolved. See p.691 


Humans face a daily threat of infection by harm- 
ful viruses. To repel them, our immune system 
mounts animmediate response following inva- 
sion that depends on its ability to recognize 
general characteristics indicating that viruses 
are foreign entities. This type of reaction, gen- 
erated by an ancient branch of the immune 
system known as innate immunity, occurs in 
all plants and animals. Many genes involved 
ininnate immune responses are evolutionarily 
conserved and encode proteins that are used 
for defence purposes in different species’ ?. 
Cohen et al.* report on page 691 that some 
bacterial species fight viral infections by using 
an innate immune mechanism that is related 
to one of the central components of innate 
immunity in animals called the cGAS-STING 
pathway. Their findings reveal that this crucial 
antiviral defence system in animals might have 
its evolutionary roots in bacteria. 


Bacterial 
cell membrane 


Infection 


sensed 4 cGAMP 
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There has been arise in evidence indicating 
that the defence systems mediating innate 
immunity in animals have counterparts in bac- 
teria. For example, protein components called 
TIR domains, which are present in defence pro- 
teins in mammals and plants, can recognize 
molecular hallmarks of disease-causing agents 
known as PAMPs, and then trigger an immune 
response. TIR domains are evolutionarily 
conserved in bacteria, protecting them from 
viruses called phages’. Another such example 
is the antiviral machinery that targets RNA and 
depends on proteins called Argonautes, found 
in plants and animals. This system also has a 
role in defence responses in bacteria and the 
single-celled organisms known as archaea®’. 

The evolutionary conservation of these 
innate immune mechanisms in bacteria and 
mammals suggests that such pathways might 
have first arisen in bacteria as protection 
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Figure 1 | A bacterial defence against viral infection. Cohen et al.* report a defence system that is used 

by bacteria to fight infection by bacterium-infecting viruses called phages. After phage infection is sensed 
(through an unknown mechanism), the enzyme cGAS is activated, producing the molecule cGAMP. Such 
changes in cGAS activity and cCGAMP levels in response to viral infection also occur in a range of multicellular 
organisms, including humans. In the bacterial system, the rise in CGAMP can lead to activation of a 
phospholipase enzyme that degrades phospholipid molecules in the bacterial cell membrane. This process 
kills the bacterial cell and can stop viral infection from spreading to neighbouring cells. 
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against phages, and have since evolved into 
different, but related, defences across the tree 
of life. One key open question is how many of 
the innate immune defences found in animals 
might have evolved from ancient bacterial 
systems. 

When the cGAS-STING pathway®” in ani- 
mals detects invading viruses in a cell, it acti- 
vates a response that either mediates antiviral 
defences or triggers cell death’®. The cGAS 
enzyme functions in this defence by sensing 
and binding to double-stranded viral DNA, 
and then inducing" the production of a type 
of signalling molecule called cGAMP, which 
is termed a cyclic dinucleotide. The binding 
of cGAMP to the STING protein sets off a sig- 
nalling cascade that unleashes an antiviral 
response. 

Cohen and colleagues analysed regions of 
bacterial genomes in which defence genes are 
clustered, and noticed that the gene encod- 
ing cGAS was often located near genes whose 
products involved in other antiphage defence 
systems, such as CRISPR-Cas. The authors 
therefore wondered whether cGAS might have 
arole in antiphage defences. 

To test this idea, Cohen et al. engineered 
bacteria lacking a cGAS system to express 
genes encoding such systems. The authors 
tested two representative cGAS systems 
(comprising the gene encoding cGAS and three 
adjacent genes) from the bacterial species 
Vibrio cholerae and Escherichia coli. Both cGAS 
systems conferred a resistance to infection by 
diverse phages. When the authors disrupted 
the DNA sequence of the cGAS-system genes, 
resistance to phage infection was completely 
lost — confirming that bacteria use this cGAS 
signalling pathway for antiviral defence, much 
as do eukaryotes (multicellular organisms that 
have anucleus in their cells). The authors called 
this antiphage defence system cyclic-oligonu- 
cleotide-based anti-phage signalling system 
(CBASS). Genes encoding cGAS proteins are 
present in approximately 10% of all sequenced 
bacterial genomes”, suggesting that CBASS 
systems have a widespread role in antiphage 
defences. 

The pathway downstream of cGAMP produc- 
tionin bacteria differs from that in animals. The 
authors report that, in bacteria, cGAMP pro- 
duction activated a phospholipase enzyme in 
some CBASS systems (Fig. 1). This activated 
phospholipase then degraded phospholipid 
molecules in the bacterial cell membrane, 
killing the infected bacterium. Such cellular 
‘suicide’ could protect a bacterial cell popula- 
tion because the destruction of infected cells 
through this process prevents the phage from 
spreading to neighbouring bacteria. 

In some bacterial species, the CBASS 
defence systems identified by Cohen and col- 
leagues lacked a phospholipase component. 
These systems instead encoded proteins that 
might trigger cell suicide through alternative 


mechanisms, such as degrading the bacterial 
genome or creating aholeinthe cell membrane 
though theaction ofa pore-forming protein. But 
whether these systems kill cells in such ways 
remains to be tested. In some cases, the CBASS 
systems encoded a protein in whichaTIR domain 
was fused to a STING domain similar to that in 
eukaryotes. The evolutionary conservation of 
these domains in an antiviral defence system in 
bacteria suggests that they might represent the 
ancient evolutionary origin of the eukaryotic 
cGAS-STING defence system. 

Although some CBASS systems had only 
cGAS genes and components required for 
bacterial cell death, others had genes whose 
products were associated with ubiquitination, 
aprotein-modification pathway in eukaryotic 
cells. In this process, a protein called ubiquitin 
is attached toa target by an enzyme-mediated 
reaction. CBASS systems included proteins 
that have several components associated with 
eukaryotic ubiquitination: El and E2 domains, 
typically found in enzymes that mediate ubiq- 
uitin activation and transfer, respectively, and 
JAB domains, which are found in proteins that 
remove ubiquitin from targets. Ubiquitination 
fine-tunes the length and intensity of innate 
immune responses in animals”. This provides 
yet another link connecting bacterial and 
animal antiviral responses. The ubiquitina- 
tion components of the E. coli CBASS system 
were required for defence against some but 
not all phages, suggesting that these proteins 
might allow systems to recognize specific 
phage proteins or features, rather than being 
amore general property of phages — thereby 
refining the activity of these systems. 

Antiphage defence systems in bacteria can 
be a target of phage-encoded inhibitor pro- 
teins. For example, phage proteins can block 
CRISPR-Cas defences”. It is highly probable 
that some phages have evolved ways to inhibit 
CBASS systems. Different CBASS systems 
encode a diverse set of cyclic-oligonucleo- 
tide signalling molecules and components, 
suggesting that cell suicide occurs through a 
number of mechanisms. The diversity of these 
CBASS-system components is probably driven 
by the need to evade a phage counter-attack if, 
for example, a phage-encoded protein could 
inactivate a particular cyclic-oligonucleotide 
signalling molecule. The selective pressure 
from antiphage systems that phages encounter 
would inevitably lead to the evolution of 
countermeasures in these viruses. An exciting 
area for future research will be to search for 
such phage inhibitors of CBASS systems. 

One key aspect of cGAS function in bacte- 
rial defence that remains unknown is which 
signal the immune system detects to rec- 
ognize that a viral infection is occurring. In 
eukaryotes, any viral double-stranded DNA 
in the cytoplasm can be recognized as a for- 
eign entity because eukaryotic DNA is usually 
confined to the nucleus and absent from the 


cytoplasm. To distinguish cytoplasmic viral 
DNA from bacterial DNA, a bacterium lacking 
anucleus would presumably require a sensor 
with a nuanced capacity to identify foreign 
DNA. One possibility is that CBASS systems 
recognize phage DNA specifically in the linear, 
relaxed state that occurs immediately after it 
has entered the bacterial cell. Perhaps the pro- 
teins that have E1, E2 and JAB domains in CBASS 
systems provide further refinement to aid the 
success of this aspect of phage recognition. 

Cohen and colleagues’ study is particularly 
remarkable for highlighting the striking parallels 
between innate immunity in eukaryotes and bac- 
teria. The number of known bacterial antiphage 
systems is growing rapidly”, and it is proba- 
ble that many more such exciting connections 
remain to be uncovered. 
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Teamwork byT cells 
boosts immunotherapy 


Jonathan L. Linehan & Lélia Delamarre 


Immunotherapy treatment harnesses CD8 T cells of the 
immune system to kill tumour cells. The finding that CD4 
helper T cells contribute to the success of this treatment in 
mice might offer a way to improve clinical outcomes. See p.696 


Immune cells called CD8 (or cytotoxic) T cells 
can target and kill cancer cells, and immuno- 
therapies that boost this process are in clinical 
use. However, for reasons that are not fully 
clear, it is hard to predict whether a person 
will respond to this treatment. On page 696, 
Alspach et al.' report mouse studies revealing 
that another type of immune cell, called a CD4 
cell (also knownasahelper T cell), has a crucial 
role in aiding CD8 T cells to target tumours 
after immunotherapy. 

Mutations in tumour cells can give rise to 
abnormal proteins, fragments of which — 
termed neoantigens — are displayed on the sur- 
face of cells bound to major histocompatibility 
complex (MHC) molecules. If a neoantigen is 
recognized by aCD8 T cell, this cell can target 
and kill any tumour cells that express the neoan- 
tigen. However, this cytotoxic response can be 
blocked, for example by animmunosuppressive 
environment surrounding a tumour. Immuno- 
therapy treatments called immune-checkpoint 
blockade or immune-checkpoint therapy can 
counteract such problems to enable CD8 T 
cells to unleash an effective immune response 
against the tumour. 
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Much immunotherapy research focuses on 
CD8 T cells. However, there is emerging evi- 
dence that CD4 T cells might havea key rolein 
tumour-targeting immune responses”*. 

Alspach and colleagues sought to iden- 
tify the minimal immune-stimulating neo- 
antigen requirement to drive an effective 
immune response in mice that were given 
an immunotherapy treatment. The authors 
studied mice that had a type of tumour to 
which the immune system does not nor- 
mally respond, and they engineered such 
tumours to express neoantigens. The neoan- 
tigen termed mLAMA4 is recognized by CD8 
T cells*, and the neoantigen termed mITGB1, 
recognized by CD4T cells, was identified by 
the authors using a computational prediction 
method. In the absence of immunotherapy, 
the expression of these two neoantigens, 
either alone or together in a tumour, was 
insufficient to trigger an effective immune 
response against the tumour. However, if 
both neoantigens were expressed in animals 
receiving immunotherapy, the tumour 
regressed. 

To check whether this response was simply 
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Figure 1 | T-cell collaboration drives an effective anti-tumour response to immunotherapy. Alspach et al.' 
studied how mice responded to immunotherapy that boosts tumour destruction by CD8 T cells. The mice 
received transplanted tumours in each flank that expressed abnormal protein fragments called neoantigens. 
The animals were then treated with immunotherapy. The neoantigen mLAMA4 is presented by typeI major 
histocompatibility complex (MHC) molecules and recognized by immune cells called CD8 T cells, and the 
neoantigen mITGB1is presented by type II MHC molecules and recognized by other immune cells called CD4 
T cells. a, Atumour that expressed mLAMA4 attracted few immune cells and grew. b, By contrast, a tumour 
onthe animal's opposite flank that expressed mMLAMA4 and mITGB1 regressed, indicating the importance of 
activating both CD8 and CD4 T cells at the tumour site to generate a successful response to immunotherapy. 


The robust immune response generated included the accumulation of CD4 and CD8 T cells (which produce the 


signalling protein IFN-y) and macrophage cells (which expressed the protein iNOS). The tumour cells lacked 
type II MHC molecules; therefore, an as-yet-unidentified antigen-presenting cell (APC) probably presents 


mITGB1to CD4T cells. 


dependent onneoantigen quantity, rather than 
the need for neoantigen recognition by both 
types of immune cell, the authors engineered 
mouse tumours to express two different neoan- 
tigens that are recognized by CD8 T cells, but not 
by CD4 cells. These tumours did not respondto 
immunotherapy, demonstrating that a success- 
fulimmune response depends onthe presence 
of neoantigens that trigger responses from 
both CD4 and CD8 T cells. 

The authors’ analysis reveals that the CD4 
T cells that responded to mITGB1 had the hall- 
marks of a type of CD4 T cell called a T helper 
type 1 cell, which can increase the number and 
cell-killing activity of CD8 T cells”. The authors 
confirmed that, if tumours expressed both 
mLAMA4 and mITGBI, this indeed caused an 
increase in the number and cytotoxic activity of 
CD8T cells, compared with the case for animals 
with tumours that expressed only mLAMA4. 
Alspach and colleagues also showed that, if ani- 
mals were first vaccinated with dying tumour 
cells and were then implanted with a growing 
tumour that expressed mMLAMA4 and mITGBI1, 
the transplanted tumours were most efficiently 
rejected if the vaccine contained tumour cells 
that expressed both mLAMA4 and mITGB1in 
the same cell. 

To determine whether CD4 T cells havea role 
beyond just enhancing the priming of CD8 T 
cells, as occurs during vaccination, the authors 
investigated whether mITGBLis required at the 
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tumour site for an active immune response. 
They implanted mice with a tumour that 
expressed both mLAMA4 and mITGBI1 on one 
flank and with a tumour that expressed only 
mLAMA4 onthe opposite flank, and treated the 
mice with immunotherapy (Fig. 1). As expected, 
the tumour that expressed mLAMA4 and 
mITGB1 was targeted by theimmune system and 
regressed, but the tumour that expressed only 
mLAMA4 continued to growslowly. In compar- 
ison with the mLAMA4- and mITGBI1-expressing 
tumour, the growing tumour was infiltrated by 
fewer CD4 T cells, and by fewer CD8 T cells that 
could recognize mLAMA4. 

These results highlight the need for a 
tumour to express neoantigens that are rec- 
ognized by both CD4 and CD8T cells to gener- 
ate a productive response to immunotherapy. 
Together, these data demonstrate that CD4 T 
cells not only aid the priming of CD8 T cells, 
but also collaborate with CD8 T cells at the 
tumour site to maintain an effective anti- 
tumour response during immunotherapy. 
The mechanism enabling this collaboration 
remains to be determined. 

The authors suggest that interferon-y (IFN-y), 
a type of immune-signalling protein called a 
cytokine, might be one necessary component 
enabling this collaboration. IFN-y is produced 
by CD8 T cells and CD4 helper T cells, and can 
help to tackle an immunosuppressive tumour 
environment?. 
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Tumours that are responsive to immuno- 
therapy are associated with the presence of 
activated immune cells called macrophages 
that express the protein iNOS (ref. 6). Alspach 
and colleagues observed that tumours that 
expressed mLAMA4 and mITGB1 had an 
impressive 83-fold increase in the presence 
of iNOS-expressing macrophages in compari- 
son with tumours that expressed only mITGB1. 
CD4 helper T cells alone are not sufficient to 
drive this iNOS expression in macrophages, 
and CD8 T cells are also required, suggesting 
an interplay between these three types of cell. 
Consistent with these findings, previous work’ 
indicates that macrophage activation triggered 
by IFN-y from CD4 T cells leads to the inhibition 
of tumour growth. 

The tumour cells studied by the authors 
express type | MHC molecules that present 
neoantigens to CD8 T cells, but they do not 
express type II MHC molecules that present 
neoantigens to CD4 T cells. The identifica- 
tion of the immune cells that present neo- 
antigens, such as mITGBI, to CD4 T cells in this 
system should be a topic for future research. 
Antigen-presenting immune cells, such as 
macrophages or dendritic cells, that capture 
material from dead tumour cells and pres- 
ent it on type II MHC molecules are probably 
involved. Indeed, dendritic cells are required® 
for the maintenance of an immunotherapy 
response in an IFN-y-dependent manner. 

Future studies could investigate whether 
immunotherapies that target both CD4 and 
CD8 T cells should be developed for clini- 
cal use. An obstacle to understanding and 
harnessing the responses of CD4 T cells for 
immunotherapy has long been the difficulty 
in identifying neoantigens that trigger such 
responses, as well as the need for adequate 
tools to monitor these responses. Alspach and 
colleagues’ work, along with that of others””®, 
suggests that this is now changing. 
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Robust evidence of 


insect declines 


William E. Kunin 


Data are mounting that document widespread insect losses. 
Along-term research project now provides the strongest 
evidence of this so far, and demonstrates the value of 
standardized monitoring programmes. See p.671 


There are certain times in life — whether in our 
relationships, personal health or scientific 
research — when we think that we know some- 
thing but the evidenceis less than conclusive. 
Anaccumulation of clues or symptoms might 
suggest a particular interpretation without 
being strong enough to clinch the argument. In 
such situations, it can bea relief to finally geta 
definitive answer, even ifthe news is bad. Once 
we know that a problem definitely exists, we 
may be able to do something about it. Read- 
ers might feel the same way when they read 
the results reported on page 671 by Seibold 
et al.', which provide compelling evidence of 
a major problem — large-scale declines in the 
numbers and diversity of insects and other 
jointed invertebrates known as arthropods. 

Insects have pivotal roles in terrestrial 
ecosystems. These organisms dominate global 
animal biodiversity in terms of their biomass, 
species numbers and total population num- 
bers, and they perform important ecosystem 
functions and services suchas pollinating flow- 
ers, disposing of dead organisms and waste, 
and forming crucial links in food webs. Insect 
declines have been implicated as possible 
drivers of declines in insect-eating birds’and in 
animal-pollinated plants’. Thus, massive losses 
ininsect diversity (Fig. 1) and abundance would 
be grounds for serious concern. 

The rumours of such declines have been 
around for sometime. Inthe 1990s, researchers 
warned that extinctions among insects proba- 
bly outstripped those of more highly studied 
organisms such as vertebrates and plants*. 
Evidence subsequently grew of declines in 
particular insect groups, including butterflies> 
and bees’. Most such studies have focused 
narrowly on particular insect orders or families, 
although a recent global meta-analysis° pro- 
vides strong indications that insect losses are 
geographically widespread and occur across a 
range of taxonomic groups. 

However, much of this evidence has come 
from biodiversity databases — records of spe- 
cies sightings, mostly collected by volunteers, 
and usually gathered in a haphazard fashion. 


Analytical techniques can use such records to 
assess changes in the local species richness? or 
species’ distributions’, but analyses of this sort 
are frustratingly indirect. The models used to 
analyse data can attempt to take into account 
the often strong temporal and spatial biases 
in these data sets. However, such results could 
still be influenced by changes in the nature of 
the biodiversity recording over time, fuelled by 
changes in observers’ goals and methods. Such 
analyses are also limited in scope. Although 
they can reveal changes in diversity, such data, 
recording a glimpse of a species at a given site 
on a particular date, do not reveal shifts in 
insect abundance, which is arguably the most 
crucial aspect to assess when monitoring 
ecosystem services®. 

Standardized sampling can fill that gap. A 
previous study’ reported the data collected 
froma network of standardized insect traps 
set by amateur entomologists in German 
nature reserves over a 27-year period. That 
study indicated that the biomass of insects 
captured declined by 75% over the period 
studied. This research raised serious concerns, 


but it had limitations: the sampling of sites was 
opportunistic and not always consistent over 
time, and although the biomass of the speci- 
mens caught was recorded, the species were 
not identified or even counted, meaning that 
species richness and abundance couldn't be 
assessed. 

Seibold and colleagues finally complete the 
circle by reporting species richness, abundance 
and biomass for a wide range of arthropod taxa 
recorded using standardized sampling’. They 
describe the results of monitoring over nearly 
ten years of intensive study in grasslands and 
woodlands in three regions of Germany as part 
of the country’s interdisciplinary Biodiversity 
Exploratories project”. 

The results show clear evidence of 
substantial declines in arthropod abundance 
and biodiversity (Fig. 2). Grasslands were 
particularly badly affected: species richness 
of arthropods fell by 34% over the monitor- 
ing period, and the arthropod biomass and 
numbers recorded dropped by 67% and 78%, 
respectively. These declines were particularly 
strong in landscapes dominated by farmland, 
suggesting that agricultural management 
could be driving this drop. The losses among 
forest-dwelling arthropods were less pre- 
cipitous by comparison, with a 36% drop in 
species richness, a 41% loss of biomass and no 
statistically significant population decline. 
The verdictis clear. In Germany at least, insect 
declines are real, and they’re every bit as severe 
as had been feared. 

Such long-term standardized monitoring 
as carried out by Seibold and colleagues is not 
cheap. However, the expense is dwarfed by the 
expenditure needed to address the problem. 
Agri-environmental programmes in the Euro- 
pean Union, for example, spend tens of billions 
of euros to encourage farmland biodiversity, 
a substantial portion of which is aimed spe- 
cifically or partially at insects (see go.nature. 


Figure 1| The meadow plant bug, Leptopterna dolabrata. Rumours of a decline in the numbers of insects 
such as the meadow plant bug are a cause for concern. 
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Figure 2 | Arthropod declines recorded in Germany. Seibold et al.’ report nearly a decade’s worth of 
standardized sampling of arthropods — jointed invertebrates such as insects — at grassland and forest sites. 
The authors provide compelling evidence of declines in arthropod populations over time. These large-scale 
changes will probably have a negative effect on key ecosystem services such as pollination. Seibold and 
colleagues found that species richness and biomass declined significantly during the course of the study in 
both types of habitat, as did the number of arthropod individuals recorded at grassland sites. The decline in 
arthropod population sizes noted at forest sites was not statistically significant. 


com/35pvdtv). Similarly, EU restrictions onthe 
use of neonicotinoid insecticides were insti- 
tuted specifically to protect insect pollinators, 
and (industry-funded) research" suggests that 
this restriction has cost EU oilseed-rape farm- 
ers more than €500 million (US$549 million) 
annually, because of reduced production and 
the consequential increased costs. 

Ignorance is expensive. Given that substan- 
tial investments of public and private funds for 
insect conservation are deemed appropriate 
by society, then surely it is sensible to spenda 
tiny proportion of such funds on monitoring, 
allowing us to assess the effectiveness of these 


actions and to adjust them if necessary. 

The Biodiversity Exploratories project 
would be a good model for such an effort. 
This is because it extends beyond popula- 
tion monitoring to provide a platform for 
cross-disciplinary science to address the 
large-scale and long-term issues that are 
crucial in driving declines of insect commu- 
nities, but which aren’t amenable to analysis 
by controlled experimentation. The project 
not only helps to document declines in insect 
populations and biodiversity, but also assists 
with diagnosing their potential causes. If such 
in-depth, landscape-scale field research and 


monitoring were rolled out more widely across 
Europe and beyond, we could begin to build 
land-use and agricultural policies on the basis 
of compelling scientific evidence. The results 
reported by Seibold and colleagues might not 
be good news, but at least now we know where 
we stand and what we should start to do. 


William E. Kunin is in the School of Biology, 
University of Leeds, Leeds LS2 9JT, UK. 
e-mail: w.e.kunin@leeds.ac.uk 
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Ninety per cent of baryons are located outside galaxies, either in the circumgalactic or 
intergalactic medium’”. Theory points to galactic winds as the primary source of the 
enriched and massive circumgalactic medium? °. Winds from compact starbursts have 
been observed to flow to distances somewhat greater than ten kiloparsecs’ '°, but the 


circumgalactic medium typically extends beyond a hundred kiloparsecs**. Here we 
report optical integral field observations of the massive but compact galaxy SDSS 
J211824.06+001729.4. The oxygen [O 11] lines at wavelengths of 3726 and 3729 
angstroms reveal an ionized outflow spanning 80 by 100 square kiloparsecs, 
depositing metal-enriched gas at 10,000 kelvin through an hourglass-shaped nebula 
that resembles an evacuated and limb-brightened bipolar bubble. We also observe 
neutral gas phases at temperatures of less than 10,000 kelvin reaching distances of 20 
kiloparsecs and velocities of around 1,500 kilometres per second. This multi-phase 


outflow is probably driven by bursts of star formation, consistent with theory 


11,12 


The galaxy SDSSJ211824.06+001729.4 that we study here, which we call 
Makani (Hawaiian for ‘wind’), isan example of a merger of two galaxies 
hosting a galactic wind thought to be powered by extreme star-forma- 
tion surface density”. At redshift z= 0.459, Makani is a compact but 
massive galaxy, with log(M,/M.) = 11.1(+0.2), where M, and M, are the 
stellar and solar masses, respectively (Extended Data Fig. 4). Our Hub- 
ble Space Telescope imaging analysis reveals a highly peaked stellar 
core (radius 400 pc) framed by two tidal tails of 10-15 kpc so that half 
of the galaxy’s light extends to about 2.5 kpc (ref. “; Fig. 1). Its stellar 
populations include old (more than a billion years, Gyr), medium-aged 
(0.4 Gyr), and young (less than 7 million years, Myr) components 
(Extended Data Fig. 5), with a current star-formation rate of 100- 
200M, yr 7. It may contain a dust-obscured accreting supermassive black 
hole, or active galactic nucleus (AGN), on the basis ofits X-ray luminos- 
ity of logl (2-10 keV) =42.5'04 ergs‘ (ref."), its mid-infrared slope, and 
the presence of highly ionized gas suchas [Ne v] at wavelength =3,426 
A (logl =40.6°0 ergs“). However, any AGN is not currently energetically 
dominant” or radio-loud (G. C. Petter et al., manuscript in preparation) 
and the data could be explained by star formation and shocks. Extremely 
high-density star formation, like that found in Makani, is capable of 
powering fast winds, independent of an AGN”. 

To study the spatial extent of its outflow, we observed Makani with the 
Keck Cosmic Web Imager (KCWI)”. The emission from the [O 11] lines at 
A=3,726 Aand 3,729 A in these data reveals a nebula with approximate 
mirror symmetry around the north-south and east-west axes, extending 


to radii of 50 kpc north and south of the galaxy nucleus, and 40 kpc east 
and west of it (Fig. 1). Its morphology resembles that of a limb-bright- 
ened, bipolar bubble, similar to those seen in other galactic winds’°” 
but ona much larger scale. This nebula is remarkable in the context of 
other [O 11] emitters. Its area of 4,900 kpc’ (136 arcsec’ above a 50surface 
brightness limit per spaxel (spectral pixel) of 5 x 10°’ erg s! cm” arc- 
sec’) makes it the largest [O 11] nebula detected around asingle galaxy in 
the field’®” or in galaxy groups”°”. Its [O 11] luminosity of 3.3 x10“ erg s* 
is several times the break in the galaxy luminosity function, L,,atz~0.45 
(ref. ”). Its rest-frame equivalent width (40 A), half-light radius (17 kpc), 
and maximum radial extent (50 kpc) put it at the top end of [O 11] emit- 
ters at z< 0.6 and radio-galaxy nebulae’®”, perhaps indicative of the 
unusual nature of this nebula as a giant galactic wind. 

On the basis of its light distribution alone, the hourglass shape of the 
nebula strongly suggests a bipolar galactic wind emerging fromits host 
galaxy. The spatially resolved gas kinematics confirm this impression 
and separate the wind cleanly into an outer region with low-velocity 
gas only and an inner region containing both low- and high-velocity gas 
(Fig. 2). The outer region with lower velocities spans radii of 20-50 kpc, 
while the high-velocity gas is concentrated within a radius of about 
10 kpc. We call these two wind components, and the associated star- 
bursts that are thought to have produced them, episodes | (0.4 Gyr ago) 
and II (7 Myr ago). We sort spaxels by the average maximum blueshifted 
velocity of (Uog,) = -700 km s™, although sorting by velocity disper- 
sion o produces similar results. Episode I then has gas with maximum 
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Fig. 1| The giant galactic wind surrounding the massive, compact galaxy 
Makani, observed by emission from the [O11] line atA =3,726 Aand 3,729 A. 
The colour scale and white contours show observed-frame surface brightness, 
and the axes are labelled in kiloparsecs from the galaxy nucleus. Contours are 
2-16% of peak flux, spaced by factors of 2. A rest-frame V-band image of the 
galaxy (Hubble Space Telescope/WFC3 F814W filter) is superimposed on the 
centre of the [O 11] image taken with KCWIat the Keck II telescope. The small 
circle at the centre illustrates the radius of the compact core (400 pc). Northis 
up andeastis to the left. 
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Fig.2| Velocity maps of the galactic wind. The velocities are from fits to 
Voronoi-binned [O 11] data and are calculated from the red side of the cumulative 
velocity distribution function to specified percentiles as maximum blueshifted 
velocity Umax = Vogy = Uso%-20 (a) and velocity dispersion 0= (U34,—Uggy,)/2 (b). Ina, 
the blue colour denotes velocity. The insets show the deblended and stacked 

[O 11] doublet versus velocity; these highlight representative bins for each star- 
formation or outflow episode. The black line is the continuum-subtracted 
spectrum, and the yellow or red filled profile is the emission line model. The 
colour denotes whether the spaxel is part of episode I (yellow) or episode II (red). 
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blueshifted velocities between -100 and -700 kms ‘while episode II has 
ahigh-velocity tail of gas to Vogy = -2,100 km s. Timescale arguments 
further support the existence of two starburst-driven wind episodes. In 
the 0.4 Gyr since the episode! starburst, a constant-velocity wind must 
have travelled at120 kms ‘to reach the edge of the nebula (50 kpc); rep- 
resentative velocities at the nebula outskirts are in fact 100-200 kms”. 
Inthe 7 Myr since the most recent starburst episode began, the speed 
required to reach the edge of the bulk of the episode II wind (10 kpc) 
is 1,400 kms‘, which is also a typical maximum velocity in the inner 
nebula. The lack of high-velocity redshifted gas in episode II may be 
due to dust in the outflow blocking the far side of the wind, while the 
lack of high-velocity gas in the episode I wind is probably due to the 
dispersal of high-velocity gas to radii exceeding 50 kpc over 0.4 Gyr or 
the deceleration of the outflow as predicted by models". Episode | gas 
also forms the telltale hourglass shape and has higher typical velocity 
dispersions (200 kms“) thanexpected for tidal features or gravitational 
motions at large radii”. 

We find two other gas phases that we associate with the episode 
Il (recent, inner) outflow. Using the Atacama Large Millimeter Array 
(ALMA), we detect molecular gas traced by CO(2-1) emission that is 
outflowing ina compact form, both blueshifted at-500 to-1,500 kms 
and redshifted at 500-1,500 kms“, from the nucleus 10 kpc northward 
(Fig. 3d). This gas is clearly part of episode II, given its high velocity 
and compact scale. Lower-velocity molecular gas, at |v| <500 kms 
and radius r< 20 kpc (Fig. 3c), is also likely to be part of the outflow 
because it is much more extended than the stellar disk and correlates 
spatially with extended, outflowing ionized gas. It may be gas from epi- 
sode II that has decelerated after reaching scales of the order of 10 kpc. 
Using resonant line emission from Mg I at A = 2,796 A and 2,803 A, we 
also detect neutral gas of temperature 7 ~ 10* K in the velocity range 
+500 kms‘ (Fig. 3b). This emission correlates with some regions of 
faint, extended CO and [O I1] emission. Although these velocities are 
modest, resonant emission on 10-kpc scales has so far been detected 
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The black contours areas in Fig. 1, and the white Hubble Space Telescope 
contour represents 1% of the peak stellar continuum flux, accentuating the 
extended diffuse stellar emission from tidal forces inthe merger. Inb, the purple 
colour denotes velocity dispersion and the yellow and red contours outline 
regions of specified Uma, =Vogy, to delineate episodes | and II. The spatial 
separation of the episodes is also reflected in the omap. The black contoursinb 
outline the inner concentration of ionized gas at velocities -1,500 to-500 kms? 
(Fig. 3d). We securely detect high-velocity [O 11] emission beyond this inner 
region through Voronoibinning. 
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Fig. 3 | Comparison of the ionized, neutral atomic and molecular phases of 
the galactic wind. The [O 11] map from Fig. lis replicated ina, with the north and 
east directions indicated in the lower right. In b, azoomed-in view of the inner 
40 kpc, molecular gas in restored CO(2-1) (green contours) is plotted ontop of 
Mg Ir emission (colour, with white contours) inthe same velocity range (-500 to 
+500 kms"). Panel c compares the low-velocity molecular gas (restored CO, 
green contours) and ionized gas ([O 11], colour and white contours) over the 


only in galactic winds, and because of strong radiation transfer effects 
such emission is not highly shifted from the redshift of the Makani gal- 
axy’s centre of mass”*”°, Blueshifted Fe 11 absorption is detected in the 
nuclear spectrum (Extended Data Fig. 2), but tracing its physical extent 
requires deeper observations. 

Estimates of the mass contained within the wind would complete its 
portrait. However, the mass of the ionized gas is uncertain without spa- 
tially resolved recombination line measurements. Bootstrapping from 
single-aperture Ha and HB measurements, we estimate 
6°$x 108(200 cm°/n,)M, of ionized gas in the nebula, with unquantified 
systematic errors (electron density n, and ionization state) probably 
exceeding the measurement error. Although it is confined to the inner 
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same velocities (-300 to +300 kms”). The CO contours are 0.09, 0.14, 0.28, 0.5, 
0.8 and 1.2 mJy per beam. Panel dis the high-velocity molecular gas (tapered, 
bottom right, contours of 0.08, 0.11, 0.122, 0.132 and 0.14 mJy per beam) onan 
[O 11] channel map showing the brightest regions of high-velocity ionized gas, 
both over -1,500 to-500 kms. Ineach zoom box, the inset spectrum isa plot of 
spatially integrated optical line flux versus velocity (-2,500 to+2500 kms‘), 
with the linemap velocity range highlighted in orange. 


10 kpc, the mass of the molecular gas in the |v| = 500-1,500 kms“ flow 
(thatis, episode ll) is substantial (2.4°6x 10°M.), with four times as much 
in the more extended +500 km st component. The ionized gas plus 
molecular wind thus contains 1-10% of the galaxy’s baryonic mass, a 
fraction that will be even larger when all phases are accounted for. The 
resulting mass flow rate for the molecular episode II componentis dM/ 
dt~Mbv/r=245M, yr‘ forv=1,000kms ‘andr=10 kpc, whichis roughly 
one totwotimes the star-formation rate. This is consistent with molec- 
ular outflow rates””° from other compact starburst mergers at z> 0.5. 
The huge, metal-enriched outflows in Makani—a key component of 
the host galaxy’s dynamically and chemically evolving circumgalac- 
tic medium (CGM)~—are consistent with the types of star-formation 
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or AGN-driven winds that populate and enrich the CGM in theoretical 
models”. A model galaxy forming stars at 100M, yr‘ ina baryonic 
halo of 2 x 10"M, and supernova-driven winds propelling gas with initial 
velocity 1,000 kms produce a10°M, (or 10°°M,) shell at r= 10 kpc (or 
100 kpc) int=10 Myr (or 400 Myr), with velocities at this time of the 
order of 1.000 kms (100 kms“)". These numbers bear a striking resem- 
blance to the observations in the context of the two-episode outflow we 
propose. The wind will continue to expand, diffuse, and virialize over 
longer timescales, as this nebula is denser and more structured than 
virialized CGM gas*” and has reached perhaps only about 10% of the 
virial radius” of a galaxy with log(M,/M.,) =11.1atz=0.46. 

The size of this wind makes it the one of the largest wide-angle, 
galaxy-scale outflows yet observed, with scales much larger than in 
other compact or high-z starbursts’””°. The morphology and velocity 
of the wind in the Teacup AGN make it a cousin of Makani, but onscales 
five to ten times smaller”. Although the Teacup also hosts diffuse gas 
over 100-kpc scales, this gas has a different physical origin than does 
the gas in Makani”®. The most comparable system may be a merger 
with compact star formation at a cosmic distance ten times closer 
than Makani. NGC 6240 (z= 0.04) has an ionized outflow that reaches 
a 40-kpc radius®. However, the outflow size relative to the stellar half- 
light radius (inside which half of the galaxy’s starlight resides) is only 
1/4 in NGC 6240 (ref. ””), versus r/r,,/. > 20 in Makani, and much 
of the NGC 6240 nebula coincides with stellar tidal features, unlike 
Makani. Furthermore, the Ha luminosity of the NGC 6240 nebula is 
four times smaller than in Makani if the core Ha emission follows the 
oxygen emission at a constant [O 11]/Ha line ratio. 

With a maximum extent of more than twenty times the stellar half- 
light radius, the oxygen nebula observed here has propagated well into 
the galaxy halo, placing it solidly in the CGM. The cool gas and metals 
in the flow are thus contributing to the buildup and enrichment of the 
CGM. This cool gas can be propelled by hot gas, radiation pressure or 
cosmic rays. The classic model of hot gas acceleration faces the problem 
that cold clouds may be destroyed during acceleration. These clouds 
may simply reform after being shredded and mixed in the hot wind”® 
or the clouds may survive the acceleration through fast radiative cool- 
ing, The mixing layers in shredded clouds can also cool hot gas from 
the halo or CGM, enhancing the amount of cool gas injected into the 
CGM”. If the cool outflow is accelerated by a hot wind, the existence of 
[O 11]-emitting gas at all radii argues for either cloud reformation on very 
short timescales or for cloud survival (coupled with enhancement from 
hot gas). The outflow we observe is thus feeding the CGM by directly 
depositing gas from the galaxy or by entraining and cooling hot halo 
and circumgalactic gas. 

Connecting the CGM with ongoing galactic winds has been challeng- 
ing because of the lack of clear evidence for such winds on large enough 
scales. Previous evidence came from theory and the statistical char- 
acteristics of the CGM as measured from single quasar absorption lines 
over large galaxy populations**. We have now observed a single galaxy, 
with all lines of sight accounted for, whose wind has entered the CGM. 
Our measurement provides one of the first direct windows into the 
dynamically and chemically evolving, multiphase CGM being created 
around a massive galaxy. 
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Methods 


KCWI observations and data analysis 

SDSS J211824.06+001729.4 was originally selected as an intermediate- 
redshift starburst galaxy with broad but spatially unresolved line 
emission”, part of a population known to host strong outflows™. 
We observed it with KCWI on the Keck II telescope on 6 November 
2018 UT (Universal Time) for 40 min. We used the blue low-dispersion 
(BL) grating and medium slicer with KCWI, yielding a resolution of 
2.5 Aand wavelength coverage of 3,435-5,525 A. We chose a central 
wavelength of 4,500 A and detector binning of 2 x 2. Conditions were 
photometric, with 0.6” seeing. Two exposures were dithered 0.35” 
along slices to subsample the long spatial dimension of the output 
spaxels (0.69” x 0.29”). The field of view of the reduced data cube is 
15.4”x 19.4”, 

We reduced the data using the KCWI data reduction pipeline and 
the IFSRED library®’. As the default scattered light subtraction in the 
pipeline leaves visible residuals, we use a routine (IFSR_KCWISCAT- 
SUB) to subtract scattered light by summing the data in 100-pixel incre- 
ments along columns (parallel to the dispersion direction) and fitting 
the least-contaminated inter-slice regions along rows (parallel to the 
spatial direction) with low-order polynomials. The default wavelength 
calibration also produces large root-mean-square (r.m.s.) residuals 
because of a mismatch with the pipeline thorium-argon (ThAr) atlas, 
sO we extract a representative spectrum from our data and find its 
wavelength solution using IDENTIFY in PyRAF (www.stsci.edu/insti- 
tute/software_hardware/pyraf). 16/20 lines in the 3,500-4,000 A and 
5,000-5,600 A ranges are from Th; the other 38 lines (including most 
of the brightest lines) are from Ar. The resulting r.m.s. residual is 0.18 
A. We then input this calibrated spectrumas the atlas into the pipeline, 
yielding a 0.07 Ar.m.s. Following the pipeline stages, we resample the 
data (IFSR_KCWIRESAMPLE) onto a 0.29” x 0.29” spaxel grid; align the 
two exposures by fitting the galaxy centroid (IFSR_PEAK); and mosaic 
the data (IFSR_MOSAIC). The resulting stacked and resampled field of 
view at 5,000 A is 53 x 67 spaxels. The reconstructed KCWI continuum 
image (rest-frame near-ultraviolet) is consistent with the Hubble Space 
Telescope WFC3/F814W image (rest-frame V) when convolved with a 
15-pixel Gaussian kernel to match the measured seeing. Finally, wesum 
the nebula’s core emission ina 3.0” circular aperture to match the Sloan 
Digital Sky Survey (SDSS)** spectrum. 

We created initial [O 11] linemaps by integrating over [O 11] and sub- 
tracting nearby continuum windows on either side. The wavelength 
interval of each map is calculated from the doublet average wavelength 
at a given velocity. We then used +300 kms flux and error maps to 
create Voronoi bins (where the velocity applies to the centroid of the 
[O 11] doublet; the doublet lines are 2.6 A apart, which corresponds to 
200 kms"). The IDL (www.harrisgeospatial.com/Software-Technology/ 
IDL) routine VORONOL 2D BINNING*® is used to construct the bins, with 
atarget signal-to-noise ratio of 10 and a threshold signal-to-noise ratio 
of 1. We fit the core spectrum, the full data cube, and the Voronoi binned 
data cube with IFSFIT**. Because very few strong stellar lines arise in 
our spectra (rest-frame 2,350-3,790 A), we use a scaled continuum 
derived from the fit to the rest-frame 2,550-5,600 A spectrum" (see 
below). We fit two velocity components to the [O 11] and [Ne V] lines. 
If any component falls below 20 in a spaxel, the spectrum is re-fitted 
with fewer components. Allowing the [O I] line ratio to float freely in 
the narrow component of the core spectrum results in an [O 11]3,729 A/ 
[O 113,726 A ratio of 1.2 (corresponding to n, = 200 cm”), while the 
broad component ratio is unconstrained. We thus fix the [O II] ratio to 
1.2 inall fits. In the core spectrum, Mg 12,852 A, Mg 112,796 A, 2,803 A, 
and Fe 11* 2,612 A, 2,626 Aare tied to the same velocity and width and 
fitted with a single component. The continuum fits to each spaxel are 
used to subtract the stellar continuum around [O11] or Mg II to produce 
the linemaps shown in Figs. 1-3. The [O 11] linemaps have a limiting lo 
surface brightness per pixel of 1.0 x 108 erg scm arcsec. For display 


purposes only, these maps are interpolated toa grid ten times finer and 
the +300 kms‘ mapsare clipped at 1.5% of peak flux (or 40). 

The core spectrum yields detections of [O 11], [Ne v] 3,426 A, MgI 
2,852 A, Mg 112,796 A, 2,803 A, and Fe 11* 2,612 A, 2,626 A in emission, 
and Fe 112,586 A in absorption. The emission lines break into two distinct 
components: a narrow feature at the systemic velocity of the host galaxy 
(z=0.45916; 0=143 kms in [Om], 197 kms‘inMgand Feemission) anda 
broad, blueshifted feature that is outflowing (z= 0.45736, v=-540 kms1, 
o=500kms1tin[O11];z=0.45666, v=-750 kms, 0=392kms tin [Ne V]). 
We correct upward spatially integrated line fluxes and luminosities for 
aGalactic extinction of A, = 0.2075 (ref.*”), which corresponds to a 22% 
correction at [O II]. We use the 2018 Planck cosmology’ to calculate 
luminosity and angular size distances. The spatially integrated [O 11] flux 
is 4.0(+0.2) x10* ergs cm”. From the core spectrum we measure a 
Mg II flux of 2.3( + 0.1) x 10° erg s ‘cm’, which corresponds to a rest- 
frame equivalent width of 2.3 A anda luminosity of 1.5 x 10“ ergs. The 
[Ne V] line flux is 4.3(+1.2) x 10’ erg stem. 


Ionized and neutral gas properties 

We parameterize the velocity distribution of the [O 11]-emitting gas in 
each spaxel using the cumulative velocity distribution function (CVDF). 
Inspaxels where only one componentis fitted, this is a Gaussian; for two 
components, the CVDF is the sum of two independent Gaussians. We 
use the 50th and 98th percentiles of the CVDF (v5, and UVogy, aS meas- 
ured from the red side of the line) to represent the mean and most 
blueshifted (‘maximum’) velocities. We define the width of the CVDF 
AS O= (V344—Vegy,)/2, which for a single component is the usual Gaussian o. 

We combine our KCWI data with two other spectra to constrain the 
integrated gas excitation, reddening and gas mass. The first is the SDSS 
spectrum that, along with [O 1], covers the [Ne 111] 3,869 A, HB, and 
[O 111] 4,959 A, 5,007 Aemission lines. The second is a spectrum acquired 
with Keck/NIRSPEC, which covers the Ha, [N 11] 6,548 A, 6,583 A, and 
[S11] 6,717 A, 6,731A lines. The latter was observed witha 0.76”-wide slit 
at a position angle of 83° east of north. We scale the NIRSPEC data to 
match the SDSS spectrum where they overlap and correct for Galactic 
extinction as above. Because of the lower signal-to-noise ratio in these 
spectra compared to the KCWI spectrum, we fix the velocities and 
linewidths of each emission line using the fit to the KCWI core spectrum. 
We measure an Ha flux of 1.9(+0.1) x 10°’ erg s' cm“ and an extinction 
of E(B-V) = 0.4(+0.2) from the Balmer decrement. We then scale this flux 
upward to account for the entire nebula, since the [O 11] flux within the 
SDSS aperture is 24% of the total. Using the estimated gas density of 
200 cm? from the [O 11] flux ratio and correcting upward for extinction 
by a factor of four yields an ionized gas mass of 6*$ x 10°(200 cm °/n,)M.. 
The extinction uncertainty drives the 50% error, but unquantified sys- 
temic uncertainties in the electron density and [O 11]/Ha line ratio 
(because we do not spatially resolve these quantities) are likely to be 
larger than this. 

We show rest-frame optical line flux ratios from the unresolved NIR- 
SPEC and SDSS spectra in Extended Data Fig. 1. The narrow componentis 
consistent with photo-ionization by young stars and a near-solar metal- 
licity, while the broad component is consistent with photo-ionization by 
an AGN” with ionization parameter U=-2 or shock photo-ionization*® 
with a velocity of at least 300-400 km s7. The higher excitation of the 
broad, outflowing componentis also illustrated in the increasing [O 111]/ 
[O 11] ratio with increasing blueshift (Extended Data Fig. 2). Besides the 
ionized gas lines, Extended Data Fig. 2 shows the absorption-line outflow 
in Fe 112,586 A; the corresponding Mg II emission that is systemic witha 
slight red wing (typical for the emission component of a resonant-line 
profile in a neutral outflow but without the usual absorption)*”°; and 
the broad, high-velocity wings of the CO(2-1) profile. 

Other high-ionization lines arise in the observed-frame optical part 
of the spectrum. Notably, the [Ne v] emission is spatially unresolved 
and found only in the outflowing component, with vog,, = 1,500 kms". 
Whereas [Ne V] is typically used as an AGN indicator“, its (extincted) 
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luminosity in Makani, 3.6(+1.0) x 10*° erg s™, is three times lower than 
the average for typical [Ne v] emitters detected at z=0.6-1.2 (ref. ”); it 
may therefore be emitted in shocks*°”, The line ratios in the broad com- 
ponent of the core KCWI and SDSS spectra of log([Ne v]/[Ne 111] 
3,869 A) = -0.7779-8,, log ([O 11]/[O m1]) = -0.11°9-°!, log ([Ne v]/ 
[Om])=-1.0773- and log (He 114,686 A/HB) <-0.87°” are also consist- 
ent with either AGN photo-ionization” or ionization in shocks with 
velocities**“*+* of at least 300-400 kms7. 


ALMA observations, data reduction and analysis 

Makani was observed by the ALMA 12-m array as part of projects 
2016.1.01072.S and 2017.1.01318.S on 11 March 2017, 10 April 2018 and 
15 December 2017 in antenna configurations C40-1 (baselines 15-287 
m) and C43-3 (baselines 15-500 m) and C43-6 (baselines 15-2,517 m) 
respectively. We used the Band 4 receivers with a representative fre- 
quency of 158.01 GHz to detect CO(2-1) at the redshift of the target. 
The total integration time on source was 212 min. The average precipi- 
table water vapour column during observations was approximately 2.5 
mm and the average system temperature was approximately 75 K. The 
atmospheric, bandpass, pointing, phase and flux calibrators included 
the sources J2148+0657, J2134-0153 and Neptune. 

We use the quality-checked ALMA pipeline-calibrated products, con- 
catenating the observations into a single measurement set. We image the 
data using CASA (version 5.1.0-74), producing three versions of the data 
cube: naturally weighted, 1” tapered and 0.6” restored. The latter uses 
a circular Gaussian restoring beam with a full-width at half-maximum 
(FWHM) of 0.6” to match the seeing of the KCWI data. We produce a 
tapered image in order to maximize sensitivity to potentially extended 
but weak CO emission around the target. We produce clean cubes by first 
generating dirty cubes and assess the r.m.s. noise per channel in each 
version. This value is then used in an iterative cleaning step (CASA clean) 
where we set a cleaning threshold of 30, chosen to maintain a balance 
between producing a clean image while ensuring that real faint extended 
structure is not removed. We use multi-scale cleaning with scales of 0”, 
0.4”, 0.8” and 1.6”. The FWHM of the synthesized clean beams in the 
naturally weighted and tapered images are 0.39” x 0.31” position angle 
PA =-73° east of north) and 1.15” x 1.08” (PA = -79°) respectively. The 
beam-restored image by definition has a circular beam of FWHM = 0.6”. 
We produce data cubes witha spectral resolution of 30 kms‘ (16 MHz), 
and image the full spectral coverage including basebands placed to 
measure continuum emission at 2 mm in line-free regions. The r.m.s. (10) 
noise per 30 km s‘ channel in the natural, tapered and restored cubes 
is 0.13 mJy beam“, 0.20 mJy beam ‘and 0.15 mJy beam” respectively. 

After examining the cubes and extracting spectra, we detect a weak 
2mm continuum component tothe observed emission: averaged over 
143.5-146.5 GHz, witha total flux density S,,,,,,=26(+10) mJy. By collaps- 
ing the cube over this frequency range we construct acontinuum image 
that is subtracted from the channels spanning the CO(2-1) line. CO(2-1) 
emission is observed out to a high velocity of v=+1,500 kms ‘inthe 
total spectrum, similar to the maximum velocities in the [O 11] nebula. 
Therefore, to measure the line luminosity we first average the tapered 
cube over v=+1,500 kms ‘and define a 30 mask for the source extent, 
based on the noise in the channel-averaged map. This mask is used to 
integrate the spectrum, measuring SAV across different velocity ranges. 
We define a second mask where regions with signal exceeding 150 in 
the collapsed tapered image are excluded, eliminating the contribu- 
tion from the bright core and eastern tidal arm. The rationale for this 
is to provide an estimate of the CO emission associated with extended 
(possibly outflowing) material. 

Line luminosities are calculated in the conventional radio units of 
Kkms"pc?as L’=3.25 x 10D,? (1+Z) Wrest 2SAV, where D, is the luminosity 
distancein Mpc, vis the rest-frame frequency of the line in GHz and SAVis 
the integrated line flux inJy kms“. CO line luminosities are converted to 
estimates of the molecular gas mas using M,,, = aL’. We quote gas masses 
including the highest signal-to-noise features, but masking them lowers 


these masses by factors of <2. Asin previous work’, we adopt a=0.34M, 
(K kms“ pc’), which is lower than both the standard Galactic and ULIRG 
conversions because high-velocity extended and/or CO emission might 
be optically thin if it is tracing molecular gas in a turbulent outflow”. 
This provides a conservative estimate of the molecular gas mass. 


Size measurements 

A Sérsic fit to the Hubble Space Telescope image of Makani yields an 
effective radius R, =2.24 kpc for a Sérsic index of n=4"™. For a pure Sérsic 
profile, R, is equivalent to the stellar half-light radius r,,», or the radius 
within which half of the stellar light arises”. The substantial extended, 
asymmetric tidal structure in a merger like Makani will affect the deter- 
mination of any Sérsic component, although in this case it appears not 
to bea large effect; a direct measure of the half-light radius from inte- 
gration of the stellar light yields r, ,. ~ 2.75 kpc. We take the average of 
these estimates, 2.5 kpc, to be the half-light radius. 

Makanihas a peaked core that is well interior of the half-light radius. 
Anestimate of its size is the radial width at half-maximum of the radial 
light profile. This measure yields a core radius of 400 pc, within which 
10% of the galaxy’s stellar light resides. This radius is comparable to 
other starbursts and post-starbursts without extended tidal structure”. 

For comparison, Extended Data Fig. 3 shows the radial profile of the 
[O 11] nebula, determined from azimuthal averages over pixels in bins of 
radial width 2 kpc. Integrating over the nebula from the centre outward 
as a fraction of the total flux within 50 kpc yields a half-light radius in 
[O 11] of 17 kpc. The short (east-to-west) and long (north-to-south) axis 
profiles, averaged in the direction perpendicular to each profile over 
bins 2kpc wide, decrease less steeply, with maximum nuclear distances 
of about 40 kpc and 50 kpc along the short and long axes, respectively. 
When doubled, these yield the quoted size of 100 kpc x 80 kpc. These 
measurements approach the size of the KCWI field of view, from which 
we infer that the nebula could be larger. 


Stellar mass estimation 

We estimate the stellar mass of Makani using the Bayesian stellar popu- 
lation synthesis modelling code Prospector*’ and the Flexible Stellar 
Population Synthesis (FSPS)**°° models (Extended Data Fig. 4). We 
assemble the spectral energy distribution at rest-frame wavelengths 
between 0.1 pm and 15 pm from the Galaxy Evolution Explorer”, the 
SDSS™, the Spitzer Space Telescope” and the Wide-field Infrared Survey 
Explorer (WISE). We adopt a Salpeter initial mass function from the 
range (0.1-100)M, and assume a ‘delayed 7 backbone star-formation his- 
tory (Tis the e-folding star-formation timescale) witha late-time burst of 
star formation superimposed. We assume a power-law dust attenuation 
curve (proportional tod °’) and allow differential attenuation between 
the light from young stars relative to the diffuse interstellar medium™. 
Finally, we compute the infrared spectrum using energy balance argu- 
ments and basic assumptions about the re-radiated infrared spectrum”. 
The median value of the marginalized posterior probability for stellar 
mass is log(M,/M.) =11.07 with an interquartile range of 10.98-11.14. To 
account for systematic uncertainties in the star formation history and 
other prior parameters we adopt an average stellar mass and uncertainty 
of log(M,/M.) =11.1(+0.2). 

We assume that the mid-infrared dust emission in Makani arises from 
star formation in order to fit the spectral energy distribution. WISE mid- 
infrared colours—W1- W2=0.74(+0.03) and W2- W3 =3.64(+0.08), in 
Vega magnitudes*°—place this galaxy in a region occupied partly by 
starbursts, but also characteristic of obscured AGN in merging galax- 
ies*’*®, The present data do not distinguish between these possibilities. 


Stellar continuum modelling 

To obtain the best constraints on the young stellar populations in 
Makani we fit its rest-frame ultraviolet-optical spectrum with stellar 
population synthesis models. This fitting is very sensitive to both the 
quality of the spectrophotometry and the strong stellar absorption 


lines in the 3,700-5,000 A range. Since the KCWI spectrum does not 
extend redwards of rest-frame 3,800 A, we use a spectrum obtained with 
the Blue Channel Spectrograph on the MMT witha 1’ slit". To further 
extend the wavelength coverage, we join the MMT and SDSS spectra 
near 4,600 A. These spectra have similar spectral resolutions (R=1,500). 
The combined spectrum (Extended Data Fig. 5) matches the SDSS ugriz 
photometry well, indicating good spectrophotometric calibration. 

We fit the MMT+SDSS spectrum with a combination of simple stellar 
population models and the Salim attenuation curve’. We use FSPS to 
generate simple stellar populations with Padova 2008 isochrones, a 
Salpeter initial mass function, and anew theoretical stellar library C3K 
(C. Conroy et al., manuscript in preparation) with a resolution of R = 
10,000. We use solar-metallicity simple stellar population templates 
with 42 ages spanning 1 Myr to 7.9 Gyr. We perform the fit with the Penal- 
ized Pixel-Fitting (pPXF) code®. Because the galaxy is very compact 
and much of its dust is likely to be in the outflow, we require all stellar 
populations to share the same attenuation. The best-fitting model has 
z=0.4590, astellar velocity dispersiono=170 kms“, and E(B- V) =0.19. 
The spectrum is dominated by a mixture of young and intermediate-age 
stellar populations, with approximately 50% of the continuum emission 
at rest-frame 5,500 A contributed by populations less than 7 Myr old. 
An additional 40% comes from a 0.4-Gyr-old stellar population. This 
implies two major starburst episodes, with the 0.4-Gyr burst perhaps 
corresponding to the first passage of the merger and the recent burst to 
the final coalescence. The 10-Myr-averaged star-formation rate inferred 
fromthe simple stellar population modelling is 175M, yr‘, after convert- 
ing toa Chabrier initial mass function”. 


Data availability 


Raw data generated at the Keck Observatory are available at the Keck 
Observatory Archive (https://koa.ipac.caltech.edu/) following the stand- 
ard 18-month proprietary period after the date of observation. This 
paper makes use of the ALMA data ADS/JAO.ALMA#2016.1.01072.S and 
ADS/JAO.ALMA#2017.1.01318.S, which are available at the ALMA Science 
Archive (https://almascience.nrao.edu/aq/). Some of the data presented 
here were obtained from the SDSS (https://www.sdss.org). The Hubble 
Space Telescope observations described here were obtained from the 
Hubble Legacy Archive (https://hla.stsci.edu/). Derived data supporting 
the findings of this study are available from the corresponding author 
upon request. 
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Extended Data Fig. 1| Line ratio diagrams of the core spectrum. Ina, the 
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are lo. The red narrow componentis consistent with star formation at near-solar 
metallicity, while the broad, outflowing componentis ionized by either an AGN 
or high-velocity shocks. 
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Microfluidic systems are now being designed with precision as miniaturized fluid 
manipulation devices that can execute increasingly complex tasks. However, their 
operation often requires numerous external control devices owing to the typically 
linear nature of microscale flows, which has hampered the development of integrated 
control mechanisms. Here we address this difficulty by designing microfluidic 
networks that exhibit a nonlinear relation between the applied pressure and the flow 
rate, which can be harnessed to switch the direction of internal flows solely by 
manipulating the input and/or output pressures. We show that these 

networks— implemented using rigid polymer channels carrying water—exhibit an 
experimentally supported fluid analogue of Braess’s paradox, in which closing an 
intermediate channel results in a higher, rather than lower, total flow rate. The 
harnessed behaviour is scalable and can be used to implement flow routing with 
multiple switches. These findings have the potential to advance the development of 
built-in control mechanisms in microfluidic networks, thereby facilitating the creation 
of portable systems and enabling novel applications in areas ranging from wearable 


healthcare technologies to deployable space systems. 


Fulfilment of the promise of microfluidics to operate as autonomous 
microscale networks in which fluids can be transported, mixed, reacted, 
separated and processed is no longer limited by experimental fabrica- 
tion challenges, but rather by difficulties in creating built-in controls’ ®. 
Theimportance of this limitation can be appreciated by noting that the 
development of the modern microelectronics that form the basis of 
computer microprocessors was ultimately determined by the creation 
of integrated circuits, with allcomponents fabricated on the same sub- 
strate. Microfluidics have already reached a level of integration in which 
networks with thousands of components, including control devices, 
are built ona single compact chip. However, in contrast with electronic 
integrated circuits, existing on-chip fluid control devices still need to 
be actuated externally. For example, microfluidic circuits fabricated 
from flexible polydimethylsiloxane (PDMS) can now incorporate a large 
number of control valves, which nevertheless have to be operated using 
control fluids through a control layer that lies on top of the working fluid 
network*®. As a result, microfluidics are still predominantly controlled 
by external hardware, despite substantial efforts over the past 20 years 
to develop systems with new control schemes°”. The construction of 
systems that forgo the current reliance on external hardware is crucial to 
further the development of portable microfluidic systems for pressing 
applications, ranging from point-of-care diagnostics and health monitor- 
ing wearables to analysis kits for field research" “. This requires devel- 
oping next-generation integrated circuits in which not only the control 
devices but also the operation of those devices is integrated on-chip. The 
development of sucha level of integration has been fundamentally lim- 
ited by the fact that, at the microscale, fluid flows tend to respond linearly 
to pressure changes and thus cannot be easily amplified or switched. 


In this Article, we explore new physics that emerges by combining 
network theory and fluid mechanics to induce nonlinear behaviour 
in microfluidics and effectively create a passive two-terminal flow- 
switch device that is entirely operated on-chip, directly by the work- 
ing fluid. Previous work that has achieved built-in control capabilities 
(often externally actuated), including oscillatory flows’ and flow rate 
regulation’®”°, generally relied on flexible membranes and surfaces. 
Microfluidics with such flexible components require flows with very 
low Reynolds numbers—a regime in which fluid inertia, and thus the 
only nonlinear term of the Navier-Stokes equations for incompressible 
fluids, becomes negligible. This has led researchers to often discount the 
potential effects of fluid inertia on the flows (as reviewed, for example, 
in refs. 7/72). Recent work has shown, however, that inertial forces can 
serve as a powerful on-chip tool to manipulate microfluidic dynam- 
ics locally”?™, including shaping streamlines®”°, mixing fluids” and 
directing particles”*”’. Here, we present networks designed to amplify 
inertial effects by incorporating properties of porous media that can be 
used for non-local fluid routing and manipulation of output patterns. 

Figure 1a shows a schematic representation of a microfluidic system 
with the fundamental network structure we consider. It consists of 
five segments arranged as two parallel channels connected by a link- 
ing channel, where the inlets are kept at acommon pressure P,, and 
the outlets are held at acommon, lower, pressure, P,,,,. One of the 
outlet channels is modified to generate a nonlinear pressure-flow 
relationship, which is achieved by introducing an array of cylindrical 
obstacles. Our principal results are supported by theory, simulations 
and experiments, and they show that we can: (i) induce a flow direc- 
tion switch through the linking channel solely by varying the pressure 
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Fig. 1| System schematics. a, Microfluidic network consisting of two parallel 
channels, joined by a linking channel, that connect high- and low-pressure fluid 
reservoirs. Grey filled circles represent stationary cylindrical obstacles. The 
labels denote pressures (P), channel lengths (L) and flow rates (Q), with arrows 
indicating the positive flow direction. b, Generic multiswitch microfluidic 
network consisting of an array of parallel channels interconnected by multiple 
linking channels. A subset of channel segments contain cylindrical obstacles. 
Flow is driven through the network by a single pressure difference (Pj, — Pout). 


difference between the inlets and outlets; and (ii) identify a pressure 
difference above which the total flow rate between the inlets and out- 
lets increases on closing the linking channel. We also predict negative 
conductance transitions when the linking channel is equipped with 
an offset fluidic diode, which are transitions associated with non- 
monotonic pressure-flow relations analogous to those previously 
realized using flexible diaphragm valves*°. The counter-intuitive 
behaviour described in (ii) is formally equivalent to the so-called 
Braess’s paradox originally established for traffic networks***, where 
closing a shortcut road has the possible effect of increasing net traffic 
flow. We demonstrate integration of the flow switch described in (i) 
by considering larger microfluidic networks, as illustrated in Fig. 1b, 
which incorporate multiple linking channels and are thus capable 
of exhibiting multiple flow switches. Flows through these networks 
are driven by asingle pressure difference and yet can be designed to 
exhibit a variety of flow states by programming the pressure at which 
each flow switch occurs. 


System design and nonlinearity 

We consider conditions under which all channel segments have the same 
width w, the working fluid is water, and all surfaces (including obstacles) 
have no-slip boundaries. We assume, without loss of generality, that the 
pressure P,,,,at the outlets is zero, and consider scenarios in which either 
the static or the total pressure is controlled at the inlets (Methods). We 
examine two network configurations of the system in Fig. 1a: the con- 
nected configuration, in which the two parallel channels are allowed 
to exchange fluid through the linking channel; and the disconnected 
configuration, in which the linking channel is closed or removed. In 
our theoretical analysis and simulations, the flows are assumed to be 
two-dimensional, yet the main results carry over to three dimensions, 
as verified in our experiments. 

For astraight microfluidic channel of length > w without obstacles, 
an approximate steady-state solution of the Navier-Stokes equations 
intwo dimensions yields a linear relation between the total volumetric 
flow rate per unit depth Qand the pressure drop AP along the channel: 
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Fig. 2| Development of nonlinear flow. a, b, Simulated flowin a channel with 
obstacles (open circles), showing no recirculation for low Re (a) and noticeable 
recirculation near the obstacles for larger Re (b).c, d, Experimentally observed 
flows around the obstacles (filled circles), visualized using pictures of 
fluorescent particles (marked in pink). The particle tracks trace the underlying 
flow structure, confirming the development of recirculation regions (white 
areas) as Re is increased from low (c) to moderate values (d). e, f, Experimentally 
measured relation between pressure loss and Re for achannel with (e; red curve) 
and without (f; blue curve) obstacles. The dashed line in eis a reference to guide 
the eye and indicates an approximately quadratic relation between pressure loss 
and flowrate. 


where pis the dynamic viscosity of the fluid. To induce deviations from 
this linear regime, we consider the effect of introducing multiple sta- 
tionary obstacles in the channel. Figure 2a, b shows simulations of the 
Navier-Stokes equations for a channel with ten cylindrical obstacles 
of radius r= w/5 (Methods). We observe recirculation regions forming 
near the obstacles for sufficiently large Reynolds number Re = 2pQ/, 
where pis the fluid density. The recirculation regions first appear for 
Re of the order of 10, and their number and size depend on Re. These 
localized structures are hallmarks of fluid inertia effects (and thereby 
of nonlinearity). We investigate how fluid inertia effects compound to 
impact the total flow rate by performing simulations across moderate 
values of Re when different numbers of obstacles are present. We find 
that a nonlinear relation between the pressure drop AP and flow rate 
Q=Re/2p emerges as soon as obstacles are introduced, and that the 
nonlinearity becomes more pronounced as the number of obstacles is 
increased (Supplementary Information section S3.1and Supplementary 
Fig. 3). 

The nonlinearity we observe in the relation between AP and Q con- 
forms to the Forchheimer effect in porous media, which characterizes 
flow through many interconnected microchannels when local inertial 
effects at the points of interconnection are non-negligible, even for 
laminar flow *. We use the Forchheimer equation to derive a relation 
between AP and Re for the channel with obstacles, given by 
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where ais the reciprocal permeability and Bis the non-Darcy flow coef- 
ficient, both depending solely on the system geometry (Methods). 
The physical mechanism giving rise to this nonlinearity is the increase 
in flow recirculation and velocity gradients for larger Re, as evidenced 
in Fig. 2a, b for Re =1 and 220. To test the impact of the inertial effects 
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Fig. 3| Braess’s paradox and flow switching. Simulation results for the 
connected and disconnected configurations of the system for a range of inlet 
pressures P,,. The flow rates are presented as a percentage of the total flowrate 
through the connected system, Q,, where we adopt the sign convention for the 
flow directions as defined in Fig. la. The flow through the linking channel 
switches direction at the critical pressure P,,= P,, which coincides with the onset 
of negative AQthat marks the occurrence of Braess’s paradox. 


in realistic systems, we perform experiments using microchannels 
fabricated from stiff PDMS (hardened by curing). Figure 2c, d shows 
experimental evidence of the increase in the number and size of 
the recirculation regions with Re, in agreement with our simula- 
tions. An approximately linear relation between —AP/Re and Re (and 
thus an approximately quadratic relation between —AP and Q) for 
a channel containing 20 obstacles is shown in Fig. 2e, which contrasts 
with the constant relation measured for a channel without obstacles 
in Fig. 2f. 


Switching and Braess’s paradox 
We incorporate the channel segment with obstacles characterized above 
into anetwork by considering the microfluidic system presented in Fig. 1a. 
Wetake the commonstatic pressure P,, at the inlets to be the controlled 
variable in the system. The total flow rate through the network is now 
simply the sum of the flows at the outlets, (Q, + Q,). In Fig. 3, we present 
results for this system from direct simulations of the steady-state solu- 
tions of the Navier-Stokes equations. As P,, is increased from zero, the 
flow rate through the linking channel Q; is initially positive before chang- 
ing direction and becoming negative once a critical pressure, defined as 
P;,is reached (Fig. 3). This flow switch results from the nonlinear change 
in pressure loss along the channel segment containing obstacles, which 
causes a switch in the sign of the pressure difference along the linking 
channel AP, (approximately P, — P,) as the flow rate through the system 
increases with P,,,. We define Q, to be the total flow rate for the connected 
system configuration and Q, to be the total flow rate for the disconnected 
system configuration, where both are regarded as functions of P.,.. 
Figure 3 shows AQ= Q..- Q, for a range of applied pressures P,,. 
Intuition may suggest that AQis positive for all values of P,, because the 
linking channel in the disconnected system can be considered to have 
an infinite fluidic resistance, while for the connected system configura- 
tion the resistance of the linking channel is finite. Hence, reducing the 
resistance of any component of the system may seem to imply that the 
total flow rate should increase for fixed P,,,. We observe, however, that 
AQbecomes negative for P,,, above the critical pressure that marks the 
flow switch, P;*, meaning that an open linking channel between the par- 
allel channels results in a lower total flow rate. Figure 3 also shows that 
the flow rate through the channel segment with obstacles, Q,, remains 
largely unchanged between the two configurations. Therefore, the 


difference in the total flow rate exists primarily in the difference in Q,, 
and Q; acts as a controlling variable of Q. 

The observation of a lower total flow rate for the connected configu- 
ration compared to the disconnected configuration for fixed P,, is a 
manifestation of a fluid analogue of Braess’s paradox. Indeed, if we 
consider the disconnected system driven by an inlet pressure P,,, > P;, 
the addition of the linking channel can result in a decrease in the total 
steady-state flow rate (as large as 10% in our simulations). The value of 
the critical pressure P;, depends, of course, on the dimensions of the 
channels, but we find that the onset of Braess’s paradox and the flow 
switch always occur at the same pressure for the range of parameters 
investigated. We obtain similar results for Braess’s paradox and flow 
switching when instead the total pressure is controlled at the inlets 
(Supplementary Information section $3.4). Our observation of Braess’s 
paradox and flow switching also has the potential to lead to additional 
control features when existing microfluidic components are integrated 
into our system. For example, by incorporating an offset fluidic diode” 
in the linking channel, the system can undergo negative (and positive) 
conductance transitions, where an increase in P,, leads to an abrupt 
decrease in the total flow rate (Supplementary Information section S4). 


Experimental results 

We performed experiments to validate our predictions of flow switch- 
ing and Braess’s paradox ina network with dimensions typical of micro- 
fluidics. A schematic of the experimental apparatus is presented in 
Fig. 4a, where an open/close valve is used to implement the addition/ 
removal of the linking channel (Methods). With the valve open, a flow 
switchis observed ata critical driving pressure P;.in the range 5-10 kPa, 
as demonstrated in Fig. 4a by images of the flows through the channel 
junctions at the end points of this pressure range. (The switching behav- 
iour has no reliance on the valve, as explicitly shown in Supplementary 
Fig. 11.) 

Aconfirmation of Braess’s paradox in this system is shown in Fig. 4b 
for driving pressures above P;, as observed in our simulations. The 
measured total flow rate is higher when the linking channel valve is 
closed than when it is open, thus demonstrating the paradox, and the 
magnitude of the paradox is observed to be larger for higher driving 
pressures. A breakdown of how the flow rate changes in channel seg- 
ments 4 and 5 individually is shown in Fig. 4c, d. Closing the valve causes 
the flow rates through both channels to increase, which is in agreement 
with direct simulations and is yet another striking aspect of Braess’s 
paradox in this system; it would be, at first, intuitive to expect that Q, 
would decrease when the in-flow from the linking channel is switched 
off. Time series of the flow rates measured as the linking channel is 
sequentially opened and closed further illustrate the transitions under- 
lying the paradox (as shown in Supplementary Fig. 12). 

In our experiments, the total pressure is controlled at the inlets and 
the experimental results are in full qualitative agreement with simula- 
tions performed under the same pressure boundary conditions (Sup- 
plementary Information section S3.4). This illustrates the robustness of 
the phenomenon, given that our simulations are intwo dimensions and 
three-dimensional effects are expected to be present in the experiments. 
We note that different aspects of the paradox have been considered in 
fluid networks, but only for macroscopic (that is, non-microfluidic) 
systems and while modelled by ad hoc flow equations” *”. Analogues 
of the paradox have also been studied in several other areas, includ- 
ing electrical, mechanical, biological, and contemporary traffic net- 
works*° **, These examples show that Braess’s paradox is a potentially 
general network phenomenon, which has remained unexplored in 
microfluidic networks. 


Network model 

To characterize the microfluidic system in Fig. 1a, we construct an ana- 
lytic model that captures the flow properties observed in our simula- 
tions and experiments. The model consists of pressure-flow relations 
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Fig. 4| Experimental observation of flow switch and Braess’s paradox. 

a, Experimental setup of the system presented in Fig. la, with flow tracking 
images (insets) at the junctions. An air-pressure pump is used to equally 
pressurize two vials containing red and blue dyed water, where each vial is 
connected to one of the system inlets. The linking channel is equipped with an 
open/close valve and channel 4 contains 20 obstacles. Images of the dyed flows 
through the junctions are shown for P;, below (5 kPa; left insets) and above (10 kPa; 


for each channel segment and, crucially, includes the most dominant 
term resulting from minor pressure losses at the channel junctions**“° 
(Methods). We model the contribution of the latter as an additive term 
K(Q,/Q,)f(Q;) to the pressure—-flow equation for channel segment 5, 
where the scaling factor fand the coefficient K are increasing func- 
tions for P,,, > O such that f(O) = K(O) = O. Several results are obtained 
from this model for P,,, > 0, as assumed throughout. First, if B= 0 (that 
is, the quadratic term is zero in equation (2)) when the static pressure 
is controlled or the dynamic pressure is negligible, then flow switching 
does not occur, in agreement with direct simulations (Supplementary 
Information section $3.2). Second, when £ > O, a steady-state solution 
can be found satisfying Q, = 0 provided that the following geometric 
condition is satisfied: 


lols _). 
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'* awL, 
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This solution identifies the critical pressure P,*. Third, for flow rates in 
the linking channel, the model predicts that a variation 6Q, is negatively 
related toa variation 6P,, around P;*. This indicates that P,, above (below) 
P; results in a negative (positive) flow rate through the linking channel. 
The first result implies that, in our experiments, the Forchheimer effect 
is necessary to achieve a flow switch. The second and third results, which 
hold even for when dynamic pressure is non-negligible, show that this 
model captures the flow switching behaviour observed inthe simulations 
and experiments. Importantly, we validate the flow-switching condition 
in equation (3) by demonstrating quantitative agreement between the 
model and simulations both when the static pressure and when the total 
pressure is controlled (Supplementary Information section S3.2). 

The model also predicts Braess’s paradox as observed in our experi- 
ments and simulations. Specifically, under the condition that equation (3) 
is satisfied and dynamic pressure is small (or static pressure is controlled), 
the model predicts the paradox to occur for 6P,,, > 0 if and only if 
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right insets) the flow switching pressure P, where the flow directions are 
indicated by the arrows. b, Total flow rate (Q, + Q;) when the linking channel valve 
is ‘open’ or ‘closed’ (see diagrams at right) for two different driving pressures 
above P;..c, d, Breakdown of the total flow rate into Q, (c) and Q; (d) for the two 
states of the valve. The plotted flow rates are averages derived from time series 
data, and the error bars indicate one standard deviation. The observed increase in 
the total flow rate when the valve is closed is direct evidence of Braess’s paradox. 


where aand care positive parameters and prime denotes derivative. If 
total pressure is controlled and dynamic pressure terms are included, 
the paradox is also predicted for 6P,, > O provided that a relation similar 
to equation (4) is satisfied (details for both cases are presented in Sup- 
plementary Information section S2). The dependence of condition (4) 
on Band kK’(O) underlines the crucial roles of nonlinearity and minor 
losses in giving rise to Braess’s paradox in our experiments, and shows 
in particular that minor losses have to be sufficiently large. Indeed, ifthe 
effect of minor losses is neglected, a manifestation of Braess’s paradox 
is still predicted to occur, but with much smaller magnitude and only 
for 5P;,,< 0, whichis inconsistent with our simulations and experiments 
(Supplementary Information section S2.3). 

The result in equation (4) also highlights a fundamental difference 
between microfluidic and electronic circuits, namely that minor losses 
(that is, energy losses associated with interactions between circuit com- 
ponents) do not have direct analogues in common electronics. Given 
the central role played by such losses in equation (4), we posit that this 
difference might be the reason why no equivalent of the Braess paradox 
effect we present has been observed in electrical networks, eventhough 
aspects of it have*®. We further investigated the impact of interactions 
between channel segments by varying the junction angles to show that 
the paradox can be further enhanced by manipulating the minor losses 
(Supplementary Information section S3.3). 


Networks with multiple programmed switches 

The system considered thus far can be generalized to create larger 
microfluidic networks with multiple flow switches—that is, networks 
with multiple disjoint channel segments in which the flow initially in one 
direction can be individually ‘switched’ to move inthe opposite direction 
throughthe manipulation of one driving pressure alone. In our design, 
the linking channel plays the role of a switch (and can be referred to as 
such). Figure 1b shows the multiswitch generalization of the network 
in Fig. 1a, which incorporates multiple linking channels and a subset 
of channel segments with obstacles. We experimentally demonstrate 
an instance of a six-switch network that exhibits flow switching in all 
linking channels (as presented in Supplementary Information section 
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Fig. 5| Flow patterns ina multiswitch network. a, Schematic of ten-switch 
network. Fluids of different colours are driven to each inlet by acommon static 
pressure source, P,,,. The outlets are labelled by O1-06 and the linking channels 
by 1-10. The arrows indicate the flow direction through each linking channel and 
multicoloured circles schematically indicate the fluid composition at each 
outlet for an initially low P,,,. The segment lengths are denoted by a,, b;,c,,d,e and 
f,whereacommon length is assumed for all linking channels and the segments 
with obstacles are marked with filled grey circles. b, Patterns of outlet flows for 
the network programmed witha chosen switching sequence as P,, is increased. 
Each column of coloured circles denotes the outlet flows after the 
corresponding flow switch occurs, where mixing between different coloured 


S6.2). Multiswitch networks can be designed by extending the network 
model presented above. 

One such network with ten linking channels is presented in Fig. 5a. By 
marking each inlet flow with a different colour, we show that a variety 
of patterns can form in the outlet flows (coloured circles in Fig. 5). The 
specific pattern at an outlet depends on the order in which the flow 
switches occur as P,,, is varied. The network model for larger systems 
is constructed by combining pressure-flow relations for each channel 
segment with flow rate conservation equations for eachjunction. Using 
this model, we can design a network for which each flow switch occurs 
near a target value of P,,, by optimizing the dimensions of the channel 
segments (Methods). 

As illustrated in Fig. 5, a set of 11 different internal flow states and 17 
unique colour combinations at the outlets are possible for the switching 
sequence realized in Fig. 5b. Figure 5c, d shows the agreement between the 
model predictions of these flow states and results from direct simulations 
of the Navier-Stokes equations. This variety of states (and output patterns) 
is achieved with only three channel segments containing obstacles and 
is parameterized by a single control variable—the driving pressure P,,,. 
Moreover, the switching is implemented solely through the working fluid, 
which differs from existing approaches that rely on flexible valves and 
additional control flows”. Thus, multiswitch networks exhibit several prop- 
erties exploitable in the design of new controllable microfluidic systems. 
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fluids is assumed to occur when passing through the same channel segment. 
c,d, Model predictions (c) and simulation results of the Navier-Stokes equations 
(d) for the flow rate through each linking channel for anetwork designed to 
exhibit the switching sequence in b. The flow rates are labelled according to the 
channels inaand are divided into two sets (top and bottom panels) for clarity. 
Positive flow rates correspond to flowin the upward direction ina, and each flow 
switch occurs when the corresponding curve crosses the horizontal axis. The 
segment dimensions that give rise to the particular switching order in b-d are 
reported in Supplementary Table 1. All 21 possible outlet flow colour 
combinations are realized between the switching sequence presented here and 
thosein Supplementary Fig. 13. 


More generally, for a multiswitch network withn, horizontal channels 
interconnected by n, linking channels, the number of possible internal 
flow states is n+ 1if each linking channel exhibits a flow switch. In addition, 
the possible number of unique colour combinations in the outlet flows is 
n{n,+1)/2 if each inlet flow is marked with a different colour. All colour 
combinations can be realized over the set of all switching sequences, 
provided that there exist flow paths allowing mixing of every set of kadja- 
cent colours for kranging from1ton,. The myriad states possible in such 
multiswitch networks underlie their ability to process inputs into multiple 
outputs and thus to support various applications, including implement- 
ing different mixing orders of chemical reagents and devising schemes 
for the parallel generation of mixtures with tunable concentrations. 


Conclusions and outlook 

The flow switch, conductance transitions and Braess’s paradox estab- 
lished in this study are all emergent behaviours of common origin result- 
ing from nonlinearity and interactions between different parts of the 
system. The nonlinearity is directly determined by fluid inertia effects, 
which can be enhanced and manipulated through the placement of 
obstacles and has the advantage of not being reliant on flexible com- 
ponents, fluid compressibility or dedicated control flows. The onset of 
Braess’s paradox is marked by the flow-switching pressure, above which 
the increased resistance of the nonlinear channel causes the flow to be 
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routed in the negative direction through the linking channel. When 
constrained by a diode, the switch in flow direction also enables nega- 
tive conductance transitions. Our results demonstrate an approach 
for routing and switching in microfluidic networks through control 
mechanisms that are coded into the network structure, thus responding 
to the call for design strategies that allow diverse microfluidic systems 
to be assembled from a small set of core components”. 

Here we considered the scenario in which the inlets and the outlets 
are (separately) held at the same pressure, rendering the network a two- 
terminal system in all cases, because this is the most stringent scenario 
for flow manipulation. Ifa multi-terminal system is configured, by allow- 
ing the pressures at each of the inlets (and/or outlets) to be varied inde- 
pendently, then the effects that we presented may be further enhanced. 
Finally, although we focused on boundary conditions in which the inlet 
pressures are controlled, it would be natural to explore in future research 
the scenario in which the controlled variables are the inlet flow rates. We 
anticipate, for example, that the negative conductance transitions would 
then be converted into pressure amplification (pressure release) transi- 
tions in which the inlet—outlet pressure difference increases (decreases) 
abruptly at the transition point. Accordingly, Braess’s paradox is also 
expected to take a complementary form in which closing the linking 
channel causes the inlet-outlet pressure difference to drop. Incidentally, 
it is this complementary form of Braess’s paradox that has been previ- 
ously established for electrical circuits*’, thus suggesting an additional 
correspondence between electronic and microfluidic circuits. 
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Methods 


Navier-Stokes simulations 

The numerical simulations were performed using*® OpenFOAM version 
4.1. We used meshes with an average cell area ranging from 10 um’ to 
340 pum’, where the finest meshing was applied near the obstacles. All 
meshes were generated using Gmsh*’. The two-dimensional solutions 
were found using the simpleFoam solver within OpenFOAM, employ- 
ing second-order numerical schemes, where a fixed static pressure of 
zero was set for the boundary conditions at the outlets. At the inlets, 
the static (total) pressure was fixed for the static (total) pressure con- 
trolled cases. For simulations of the multiswitch network in Fig. 5, the 
same geometry and dimensions were used as for the model predictions, 
provided in Supplementary Table 1, and equal driving pressures were 
applied at each of the six inlets. 


Reynolds numbers 

The characteristic length scale used in defining the Reynolds number of 
the flowis the hydraulic diameter of the channels, defined as 44/P, where 
Aisthe area and Pis the perimeter of the channel cross-section (common 
to allsegments). The hydraulic diameter in two and three dimensions is 
2w and 2wh/(w + h), respectively, where his the height of the channelsin 
the three-dimensional case. The characteristic velocity used in two and 
three dimensions is Q/w and Q/wh, respectively. Therefore, we define 
Re=29Q/ for our simulations in two dimensions and Re =2pQ/p(w + h) 
for our experiments in three dimensions. The undeclared ranges of Re 
for the channel segment with obstacles considered in the presented data 
are: 21-385 (Fig. 3), 12-121 (Fig. 4), 1-220 (Fig. 5), 1-380 (Supplementary 
Fig. 2), 4-111 (Supplementary Fig. 4), 40-385 (Supplementary Fig. 7), 
20-400 (Supplementary Fig. 8), 2-10 (Supplementary Fig. 11b), 75-85 
(Supplementary Fig. 11c), 76-89 (Supplementary Fig. 12), 10-20 (Sup- 
plementary Fig. 14b) and 110-120 (Supplementary Fig. 14c). 


Pressure boundary conditions 

We consider two different boundary conditions for the driving pressure 
P,, at the system inlets. Under one condition, total pressure is controlled 
and the inlets open directly into a high-pressure reservoir. Under the 
other condition, static pressure is controlled andthe inlets are connected 
to the reservoir by pressure regulators. Total pressure is the sum of 
static pressure and dynamic pressure, where dynamic pressure is defined 
as 5pv" for a fluid with density p and velocity v. The distinction between 
these boundary conditions is often neglected in the microfluidics lit- 
erature when the Reynolds number is less than one”, but it can become 
important for larger Reynolds numbers (even though the flow remains 
laminar)”. 


Pressure-flow relations 

We use equation (1) to describe the pressure-flow relation for straight, 
obstacle-free channels; it is derived directly from the Navier-Stokes 
equations by assuming plane Poiseuille flow through a two-dimensional 
channel. To describe the nonlinear pressure-flow relation observed 
for the channel with obstacles, we refer to the Forchheimer equation: 
-AP=apLV+ BpLV’, where Vis the average fluid velocity. In two dimen- 
sions, V= Q/w = pRe/2pw and, thus, the Forchheimer equation can be 
written in the form of equation (2). In agreement with equation (2), we 
find an excellent linear fit between —AP/Re and Re for achannel with ten 
obstacles, and we validate the fit by predicting flows through the same 
channel fora fluid with a different viscosity (Supplementary Information 
section S3.1). We observe no unsteady flow through the channel with 
obstacles due to vortex shedding for Re of up to 400, as expected for 
systems with highly confined obstacles, which permits the use of the 
steady-state relation in equation (2) over the range of Re considered here. 
We experimentally verify the source of nonlinearity in PDMS channels 
with obstacles, which were designed to have approximately square cross- 
sections to minimize deformation (which could lead to other forms of 


nonlinearity***). Through additional experiments, we confirmed that 
pressure-flow relations similar to those in Fig. 2e, f hold for channels 
constructed from materials with both higher rigidity (SU-8 photoresist) 
and lower rigidity (Flexdym) than the PDMS (Supplementary Informa- 
tion section S5 and Supplementary Fig. 10). We note that porous-like 
structures have been previously used both to study non-inertial effects 
in microfluidics, such as droplet formation” and viscous fingering”, 
and to study inertial effects in larger systems”. In our system, inertial 
effects arise at the microfluidic scale even for a much smaller number 
of obstacles than the typical number in porous-like materials. 


Network flow model construction 

The analytic model used to describe the system in Fig. 1a is constructed 
as follows: (i) we consider the pressure at the inlets P,,, to be in the vicin- 
ity of P;*; (ii) we approximate the pressure-flow relation through the 
linking channel as Q, = k(yP, — P,), where x is the channel conductivity 
and yis a free parameter allowing for an effective pressure difference; 
(iii) the flow equation for each other channel segment without obstacles 
is written as in equation (1), where —APis the pressure drop along the 
segment and L is the segment length; (iv) for the channel segment with 
obstacles, we take the flow equation to be in the form of equation (2) 
(with Re expressed as 20Q/j1); (v) we include the most dominant term 
resulting from minor pressure losses at the channel junctions. Therefore, 
the model consists of five pressure-flow relations, in addition to two 
flow conservation equations at the junctions: Q, + Q, - Q, = 0 and 
Q;+ Q;- Q,=0. When the static pressure is controlled at the inlets, the 
only nonlinearity that exists inthe model comes from the Forchheimer 
term due to the presence of obstacles and the minor loss term. The 
model can also be adapted for when total pressure is controlled by tak- 
ing the static pressure at each inlet to be P,, — koQ’/2w”, where P,, now 
denotes total pressure and the coefficient kis a constant of order unity 
that only depends on the shape of the inlet velocity profile (k=~1 fora 
uniform velocity profile at the inlet, as considered here). However, the 
dynamic pressure term pQ’/2w’ is often negligible in real microfluidic 
systems because of the high pressures needed to drive fluid though the 
channels. Indeed, in our experiments, the dynamic pressure near P;* 
was smaller than the static pressure by two orders of magnitude and 
smaller than the pressure loss due to the Forchheimer effect by one 
order of magnitude. This can also be seen in Fig. 2f, where a constant 
relation between Re and AP/Re is measured. Details of the model are 
presented in Supplementary Information section S1. 


Designing multiswitch networks 

For anetwork with multiple switches and a given set of channel dimen- 
sions, the value of P,,, for which a specific flow switch occurs can be deter- 
mined through the addition of a constraint to the model that enforces 
the flow through the corresponding linking channel to be zero. Then, 
the dimensions of a chosen subset of channel segments may be varied 
through an optimization procedure in order to design a network for 
which each flow switch occurs near a target value of P,,. Depending on 
which dimensions are allowed to be adjusted, the desired relative order 
of the switches can be achieved exactly, and the final set of switching 
pressures can be very close to the target ones (often <5% difference), 
where the former is expected to be more important in applications. 
Further details on the design of multiswitch networks are presented 
in Supplementary Information section S6.1. 


PDMS channel fabrication 

The flow channels were assembled by sealing a patterned PDMS chip 
against a glass slide. The PDMS chip was made by pouring a mixture of 
PDMS oligomer and cross-linking curing agent (Sylgard 184) at a weight 
ratio of 10:linto a mould after being degassed under vacuum. The mixture 
was cured at 74 °C for 1hand then peeled off fromthe mould to yield the 
microchannel design. The dimensions of the channels in Figs. 2and 4 were 
200 pm (width) x 185 pm (height), and the diameter of the obstacles was 
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97 um. After punching the holes for inlet and outlet connections, the PDMS 
chip was thermally aged at 200 °C for 12 h to reduce pressure-induced 
deformation®, yielding a chip with a Young’s modulus of’ approximately 
3 MPa. Both the PDMS chip and the glass substrate were cleaned with 
isopropanol and treated by plasma for 90 s before bringing them into 
contact. Once the PDMS chip was sealed against the glass slide, the device 
was placed in an oven for 30 min at 74 °C to improve bonding quality. 

The mould used was a silicon wafer containing microchannel pat- 
terns created by soft photolithography using a negative photoresist°°". 
A 4-inch silicon wafer (test grade, University Wafer, Boston, MA) was 
cleaned with acetone and isopropanol and dried with nitrogen gas. The 
wafer was then coated with SU-8 50 negative photoresist (MicroChem 
Corp., Newton, MA) ona spin coater (Laurell Technologies Corp., North 
Wales, PA) operating at 600 rpm for 30s. After a pre-exposure bake at 
65 °C and subsequently at 95 °C, each for 60 min, the coated wafer was 
exposed to UV light (Autoflood 1000, Optical Associates, Milpitas, CA) 
through a negative transparent photomask that contained the desired 
channel design. Following a 3.5 min post-exposure bake at 95 °C, the 
wafer was developed in SU-8 developer (MicroChem Corp., Newton, 
MA) for 60 min to obtain the pattern. 


Flexdym channel fabrication 

Flexdym (Blackholelab Inc., Paris) isa thermoplastic elastomer (Young’s 
modulus of 1.18 MPa) with a rapid and easy moulding process for micro- 
fluidic devices®™. After fabrication of the silicon wafer mould containing 
the channel designs, a sheet of Flexdym (6 cm x 4.cm) was placed directly 
above the mould with another sheet of unpatterned PDMS (about 1mm 
thick) placed above the Flexdym for protection. The whole set was then 
placed ona heat press between two Teflon sheets. The plate on the heat 
press was heated to 175 °C before starting to mould the Flexdym. Once 
the target temperature was reached, the lever on the heat plate was 
locked down with a timer set for 5 min. After the process was finished, 
the lever was released and the Flexdym sheet was inspected visually to 
make sure that no bubbles were trapped around the channel. The chip 
was allowed to cool down for 5 min before unfolding the layers. The 
Flexdym was permanently sealed with a glass slide by following the same 
sealing procedure used for the PDMS channels. The dimensions of the 
cross-section of the channels were 201 pm (width) x 166 um (height), 
and the diameter of the obstacles was 99 pm. 


SU-8 photoresist channel fabrication 

To make microfluidic channels directly from SU-8 photoresist, an inverse 
mask was designed and printed on transparency. The desired channel 
was printed on the inverse mask in black with transparent dots mark- 
ing the obstacles, and the rest of the mask was left transparent. The 
same procedure used to make the silicon wafer master as described in 
Methods section ‘PDMS channel fabrication’ was followed to fabricate 
the channels on glass slides. The chip was then sealed by 3M VHB tape 
to another glass slide with holes for connections. The dimensions of the 
cross-section of the channels were 209 pm (width) x 196 pm (height), 
and the diameter of the obstacles was 90 tm. The Young’s modulus of 
SU-8 photoresist is 2 GPa (from the table of properties for SU-8 perma- 
nent photoresists, MicroChem Corp., Newton, MA, available at http:// 
microchem.com/pdf/SU-8-table-of-properties.pdf). 


Flow rate measurement 

Experimental measurements in Figs. 2 and 4 were made with the sys- 
tem shown in Fig. 4a. When measuring the relation between pressure 
and flow rate, the linking channel valve was closed to allow separate 
measurement of the channel with and the channel without obstacles. 
Deionized (DI) water was pumped through each channel anda pressure 
scan from O to 100 kPa was performed using an Elveflow OB1 pressure 
controller. The flow rate was measured by an Elveflow MFSS flow sen- 
sor (0.2-5 ml min”). To verify Braess’s paradox, the same instruments 
were used and the pressure was set constant while recording the flow 


rate at each outlet. Red (3 gl’, FD&C Red #40, Flavors & Colours) and 
blue (1.5g1", FD&C Blue #1, Flavors & Colours) dyes were added into DI 
water to demonstrate the switching behaviour. The concentrations of 
the dyes were adjusted for similar flow rate under the same pressure. 
The flow rate measurements in Supplementary Fig. 10 were performed 
using isolated channels constructed from Flexdym and SU-8 photore- 
sist, respectively. 


Fluorescence imaging 

Fluorescent polyethylene microspheres (10-20 tum) were suspended 
in Tween 80 solution (Cospheric LLC, Santa Barbara, CA) and pumped 
through a single microfluidic channel with obstacles by an Elveflow OB1 
pressure controller. Two different pressures were applied, 3 kPa and 
100 kPa, to demonstrate different flow profiles around the obstacles. 
Fluorescence images were captured with an Olympus BX51 microscope 
equipped with a NIBA filter through an Infinity 3 CCD camera. 


Measured flow rate data and statistics 

Savitsky—Golay filtering was applied to all flow rate data collected 
through experiments, using a window length of 11 data points anda 
second-order polynomial. For each of the fixed pressures presented 
in Fig. 4b-d, a 60s time series of flow rate data was collected at each 
of the outlets with a sampling rate of 10 Hz. Over the 60 s interval, the 
linking channel valve was sequentially opened/closed every 15 s. For 
each time series, the 15 s intervals in which the valve was open (closed) 
were averaged to create a single 15 s time series for each outlet. The total 
flow rate (Q, + Q;) was calculated when the valve is open and closed, 
respectively, by summing the 15 s time series for the two outlets point- 
by-point. The statistics presented in Fig. 4 are the average and standard 
deviation of the resulting series. For Supplementary Fig. 12, the flowrate 
at each of the two outlets was measured experimentally at a sampling 
rate of 100 Hz over a180 s interval, during which the linking channel 
was sequentially opened/closed every 30 s. The total flow rate in Sup- 
plementary Fig. 12c was calculated by summing, point-by-point, the 
data in Supplementary Fig. 12a, b. 


Parameters in simulations and experiments 

Inthe simulations, we set p=10° kg m™®, u=107 Pas, v=y/p=10 °m’s", 
w=500 um for the width of all channels, and r=100 um for the radius of 
all obstacles, unless otherwise noted. In all experiments, DI water was 
used as the working fluid. The other undeclared dimensions were as fol- 
lows. In Fig. 2a, b, the length of the (partially shown) channel was 1.25 cm. 
In Fig. 2c—e, the channel length was 4.3 cm, and in Fig. 2f the channel 
length was 2.0 cm (see Methods section ‘PDMS channel fabrication’ for 
the remaining dimensions). In Fig. 3,L,;=0.17 cm, L,=1.0cm,L,=0.1cm, 
L,=1.25cm, andL,=1.0 cm. In Fig. 4,L,;=0.6cm,L,=2.9cm,L,=1.4cm 
andL;=1.4cm. For the linking channel, the switch valve was connected 
tothe two parallel channels through 15 cm of round tubing and 0.7 cm of 
microchannel on each side. Each inlet was connected to the pressurized 
vials through 62 cm of tubing, and each outlet was attached to 50 cm of 
tubing. The inner diameter of all tubing was 0.79 mm. 


Data availability 


The datasets generated and/or analysed during the current study are 
available from the corresponding author on reasonable request. 


Code availability 


Custom Python code is available from the corresponding author on 
request. 
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Superconductivity can occur under conditions approaching broken-symmetry parent 


states’. In bilayer graphene, the twisting of one layer with respect to the other at ‘magic’ 
twist angles of around 1 degree leads to the emergence of ultra-flat moiré superlattice 
minibands. Such bands area rich and highly tunable source of strong-correlation 
physics? >, notably superconductivity, which emerges close to interaction-induced 
insulating states®’. Here we report the fabrication of magic-angle twisted bilayer 
graphene devices with highly uniform twist angles. The reduction in twist-angle 
disorder reveals the presence of insulating states at all integer occupancies of the 
fourfold spin-valley degenerate flat conduction and valence bands—thatis, at moiré 
band filling factors v=0, +1, +2, +3. Atv =—2, superconductivity is observed below 
critical temperatures of up to 3 kelvin. We also observe three new superconducting 
domes at much lower temperatures, close to the v= 0 and v= +1 insulating states. 
Notably, at v=+1we find states with non-zero Chern numbers. For v=~—1 the insulating 
state exhibits a sharp hysteretic resistance enhancement when a perpendicular 
magnetic field greater than 3.6 tesla is applied, which is consistent with a field-driven 
phase transition. Our study shows that broken-symmetry states, interaction-driven 
insulators, orbital magnets, states with non-zero Chern numbers and superconducting 
domes occur frequently across a wide range of moiré flat band fillings, including close 
to charge neutrality. This study provides a more detailed view of the phenomenology 
of magic-angle twisted bilayer graphene, adding to our evolving understanding of its 


emergent properties. 


Interactions dominate over single-particle physics in flat-band electronic 
systems, and can give rise to insulating states at partial band fillings“, 
superconductivity’ and magnetism? “*. Recently, correlated insulating 
phases and strongly coupled superconducting domes have been found 
inultra-flat bands of magic-angle twisted bilayer graphene (MAG) close 
to half-filling (v=+2), establishing graphene asa platform for the inves- 
tigation of strongly correlated two-dimensional electrons®> 8, MAG 
has several advantages that should enable new insights into these sys- 
tems: the correlations can be accurately controlled by varying the twist 
angle between the two graphene layers; techniques for the fabrication 
of ultra-clean graphene layers are well-established; and the electron 
density (ng = Ag! = 10” cm, where Agis the area of the moiré unit cell) 
that is required to filla moiré superlattice band can be adequately sup- 
plied by electrical gates. 

Here we report the observation of correlated states at all integer 
fillings of v=n/n, (where n is the gate-modulated carrier density), 


including at charge neutrality, and the occurrence of new supercon- 
ducting domes and orbital magnetic states in MAG. When interac- 
tions are neglected, the two low-energy moiré bands of MAG have 
fourfold spin-valley flavour degeneracies, which implies that the 
density measured from the carrier neutrality point (CNP) is 4n) when 
the flat conduction band is full and —4n, when the valence band is 
empty”. Interactions can lift the flavour degeneracies and give rise 
to completely empty or full spin-valley polarized flat bands—with 
interaction-induced gaps at all integer values of v—in place of the 
symmetry-protected Dirac points that connect the conduction and 
valence bands for each flavour’®. The many-body physics of these bands 
is highly sensitive to the twist angle 6 and the interaction strength <7 
(where ¢ is the effective dielectric constant in MAG). In some cases— 
depending on the details of the electronic structure—bands can have 
non-zero Chern numbers””””° ”, allowing for the possibility of orbital 
magnetism and anomalous Hall effects. 


'ICFO - Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Castelldefels, Barcelona, Spain. Department of Physics, University of Texas at Austin, Austin, TX, 
USA. ?National Institute for Materials Science, Tsukuba, Japan. “Beijing National Laboratory for Condensed Matter Physics and Institute of Physics, Chinese Academy of Sciences, Beijing, China. 
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Fig. 1|Integer-filling correlated states and new superconducting domes. 
a, Schematic of atypical MAG device. b, Atomic force microscopy image and 
schematic of how various measurements are obtained. Scale bar, 2 1m. 

c, Four-terminal longitudinal resistance plotted against carrier density at 
different perpendicular magnetic fields from OT (black trace) to 480 mT (red 
trace). d, Colour plot of longitudinal resistance against carrier density and 
temperature, showing different phases including metal, band insulator (BI), 
correlated state (CS) and superconducting state (SC). The boundaries of the 
superconducting domes—indicated by yellow lines—are defined by 50% 
resistance values relative to the normal state. Note that the transition fromthe 
metal to the superconducting state is not sharp at some carrier densities, which 
adds uncertainty to the value of 7. extracted. e, Longitudinal resistance at 
optimal doping of the superconducting domes asa function of temperature. 
The resistance is normalized to its value at 8 K. Note that data points for 


Figure 1a is aschematic of atypical graphite-gated, hexagonal boron 
nitride (hBN)-encapsulated MAG heterostructure device. The atomic 
force microscopy image in Fig. 1b shows the high structural homoge- 
neity of the device. Figure Ic shows four-terminal resistance R,,.as a 
function of n at different out-of-plane magnetic fields B,, measured 
at atemperature T of 16 mK. We find strong resistance peaks at n=4ny 
= +3 x 10” cm™ that mark the edges of the flat bands, consistent with 
previous studies**"’. The full-band density corresponds to an average 
twist angle across the device of about 1.10°. By comparing 2, values 
extracted from two-terminal measurements between different contact 
pairs (Extended Data Fig. 4), we estimate that the variation in twist angle 
(A@) is only around 0.02° over aspan of about 10 pm. Such homogeneity 
inthe twist angle is, to our knowledge, unprecedented ina MAG device. 

In addition to the resistance peaks at the CNP and at v=+4, we also 
observe interaction-induced resistance peaks at all non-zero integer 
fillings of the moiré bands (v= +1, +2, +3), corresponding to 1, 2 and 
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n= -7.5x10" cm” are overlaid by the data points forn=5*10"cm~, as both 
curves followa very similar line. f, Conductance G,, plotted against inverse 
temperature at carrier densities corresponding to v=0,1,+2 and 3. The straight 
lines are fits to G,,.« exp(-A/2kT) (where A is the size of the correlation-induced 
gap and kis the Boltzmann constant), for temperature-activated behaviour, and 
give gap values of 0.35 meV (v=~-2), 0.14 meV (v=1), 0.37 meV (v=2), 0.27 meV 
(v=3) and 0.86 meV (v= 0; CNP). g, Mean-field phase diagram for neutral v=O 
(CNP) twisted bilayer graphene, as a function of twist angle and interaction 
strength, showing different configurations of C, symmetry and Chern number 
(C). Red and blue regions with solid outlines indicate states that do not break 
symmetry, and therefore have bands with no Berry curvature and vanishing 
Chern number. Blue indicates a gapped state and red indicates a gapless state. 
Zones filled with other colours indicate gapped states that break C,7symmetry 
and have bands with different Chern numbers, as shown. 


3 electrons (+) or holes (—) per moiré unit cell (Fig. 1c). Signatures of 
some of these resistive states have been observed previously?*'**, 
but they are much more strongly developed here. From temperature- 
dependent transport behaviour over a range of 10 K (Fig. 1f), itis possible 
to extract the activated gap size of the correlated insulator states. We 
obtain values of 0.34 meV (v=—2), 0.37 meV (v=2) and 0.25 meV (v=3). 
Evidence for thermally activated transport is much weaker for the 
v=I1state (0.14 meV) andis entirely absent for the v=-3 and v=—Istates, 
which might indicate that these are correlated semi-metallic states 
rather than insulating states”. 

Our device also shows clear temperature-activated transport behav- 
iour below 33 K at the CNP, with an extracted gap size of 0.86 meV. Gaps 
at the CNP donot require broken flavour symmetries, but they do require 
that at least one of the emergent C, and C,7 symmetries—which pre- 
vent CNP bands from touching—be broken. These symmetries can be 
explicitly broken by crystallographic alignment of the MAG and hBN 
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Fig. 2| The superconducting domeat fillings between v= 0Oand v=1. 

a, Differential resistance plotted against d.c. bias current at various 
temperatures from 60 mK (black trace) to 160 mK (red trace). The blue dashed 
line is a fit to the V,,.~ power law, and identifies a BKT transition ata 
temperature (7,,;) ofaround 110 mK. b, Longitudinal resistance plotted against 
temperature at various out-of-plane magnetic fields, showing that normal levels 
of resistance are restored at magnetic fields greater than300 mT. c, Two- 
dimensional colour plot of the differential resistance as a function of magnetic 
field and excitation current at 16 mK. The orange traces show differential 
resistance plotted against current at magnetic field values of 225 mT, 150 mT, 
75mT and OT (top to bottom). d, Values of the critical magnetic field at various 
temperatures. The straight line is a fit to the Ginzburg-Landau expression. For 
all measurements in a-d, the carrier density was fixed at optimal doping of the 
domen=5x10"cm”. 


layers; however, careful inspection of the angle between these (see Sup- 
plementary Information) allows us to rule out this scenario. As we also 
donot observe any other signatures of hBN alignment, such as satellite 
resistance peaks’, we conclude that the gap at CNP probably originates 
as aresult of interactions. 

The existence of a non-trivial gap at the CNP has strong implications 
for the properties of other gapped MAG states. Mean-field theory (Fig. 1g, 
Supplementary Information), predicts gapped states at neutrality over 
awide range of twist angles and interaction strengths. Gapped states at 
non-zero integer values of vare expected only when the moiré superlat- 
tice band width is smaller than the exchange shift produced by band 
occupation, and this occurs only near the magic angle. Overall our 
calculations demonstrate that insulating—or for weak interactions, 
semi-metallic states—are common atall integer values of v, as observed 
experimentally. This mean-field phase diagram does not allow for bro- 
ken translational symmetry, which appears not to be required for our 
experiments. If broken translational symmetry did have a key role in 
establishing insulating states, they would be expected at moiré band 
fillings v=n + p/3 where p and nare integers; this is not consistent with 
our experimental observations. 

Notably, in four distinct carrier-density intervals between integer fill- 
ing factors, we observe sharp decreases in the resistance of the device 
with decreasing temperature (Fig. le), which can be restored by the 
application of a small perpendicular magnetic field B, <500 mT (Fig. 2b). 
Figure 1d shows a colour plot of resistance against temperature and 
carrier density, in which four dome-shaped pockets of low resistance 
flank the most resistive states. In three of these domes, the resistance 
decreases to zero (Fig. le, Extended Data Figs. 5, 7), which is consist- 
ent with superconductivity. In the fourth region (at n=5 x 10" cm”) 


the resistance remains slightly greater than zero, owing to insufficient 
cooling of electrons below 100 mK in the cryostat”. 

Inthe dome close to —2np, we observe a superconducting transition. 
Although this has been reported previously, the superconducting transi- 
tion temperature 7, (defined as half the normal-state resistance) of 3 K 
that we observe here is considerably higher than the previous value’ of 
around 1.7 K. The other superconducting domes, which to our knowledge 
are all observed here for the first time, have much lower 7, values and 
much sharper transitions. We identify a superconducting dome between 
n, and 2n, with T, = 650 mK, and two domes between the CNP and +n, 
with 7,160 mK and 7, = 140 mK, respectively. As can be seen in Fig. 1d 
and Extended Data Fig. 6, it is likely that additional superconducting 
domes are developing between other filling factors; however, these are 
not fully developed, and are presumably obscured by the inhomogeneity 
that remains in our improved samples. 

Figure 2 shows the signatures of the newly observed superconducting 
domes (exemplified by the state between CNP and n,; all other states are 
described in detail in Extended Data Fig. 7 and Extended Data Table 1). 
In Fig. 2a, the differential resistance dV,./d/is plotted against the d.c. 
bias current /at various temperatures. At 60 mK, the traces display the 
nonlinear resistance typical of two-dimensional superconductivity, 
with a sharp resistive transition for />/.~ 3 nA (where /, is the critical 
supercurrent). The blue dashed line is a power-law fit to dV,,/d/=P, 
consistent with two-dimensional superconductivity described by the 
Berezinskii-Kosterlitz-Thouless (BKT) theory, showing a transition 
temperature 7x; of around 110 mK. 

The temperature dependence of the resistance R,,at various magnetic 
fields is illustrated in Fig. 2b. The superconductivity signal is gradually 
weakened upon increasing the applied field, and R,,. varies almost linearly 
with temperature above a critical field B, = 300 mT. The suppression 
of superconductivity by the magnetic field is further exemplified by 
Fig. 2c, which shows a plot of the differential resistance as a function 
of the magnetic field and the excitation current / at 16 mK. The critical 
supercurrent /, is reduced by application of a magnetic field, reach- 
ing zero when B, > 300 mT. From these measurements, we extract the 
temperature-dependent critical magnetic field B, (defined by 50% of 
the normal state R,,, value). By fitting to the expression from Ginzburg-— 
Landau theory, B,=[®,/(2mé)](1- 7/T,), we extract a coherence length 
Scur-ox Of around 32 nm. Here ®, = h/(2e) is the superconducting flux 
quantum, his Planck’s constant and eis the electron charge. 

We have studied the response of the flat bands to an applied mag- 
netic field at atemperature of 100 mK. Figure 3a shows a colour map 
of the resistance as a function of carrier density and magnetic field, 
and the corresponding schematic highlights the trajectories of the 
resistance maxima. We find sets of Landau fans that originate from 
the CNP and from most of the resistive states with an integer filling 
factor. In previous studies, Landau levels were identified only on the 
high-carrier-density sides of insulating states*"®. Here, we also observe 
Landau levels dispersing to lower densities. The vanishing carrier den- 
sities near most integer filling factors—as evidenced by both Landau 
fans and weak field Hall resistivities (Extended Data Fig. 3)—suggest 
that the fourfold spin-valley band degeneracy of the non-interacting 
state is lifted over a large range of filling factors, resetting the carrier 
density per band. 

Our observations suggest that a rich variety of spin-valley broken- 
symmetry states occur as a function of carrier density and magnetic 
field. Landau levels that can be traced to the CNP exhibit fourfold degen- 
eracy witha filling-factor sequence of v, = +4, +8, +12,...,as wellas spin— 
valley broken-symmetry states with v, = +2. The Landau levels that fan 
out from v= 2(-2) follow a sequence of v, = 2(—2), 4(-4), 6(-6),... at low 
magnetic field, indicating partially lifted degeneracy for either spin or 
valley. Quantum oscillations from v=~—2 exhibit a dominant degeneracy 
sequence of v, =—3,-5, —7,... at high magnetic field. Near v=—3, quantum 
oscillations exhibit fully lifted degeneracy of Landau levels with filling 
factors v, =—1, -2, -3, -4,.... The Landau fans that emerge from insulating 
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n (10'2 cm-?) 


Fig. 3 | Shubnikov-de Haas oscillations in the MAG flat bands. a, Top, colour 
map of longitudinal resistance plotted against carrier density and magnetic 
field. Bottom, the corresponding schematic that identifies visible Landau level 
fans with a dominant degeneracy. The Landau fan diagram diverging from the 
CNP (v=0) follows a fourfold degenerate sequence with v, =+4, +8, +12,..., with 
symmetry-broken states at v, = +2. The fans from v=2(-2) follow a twofold 
degenerate v, =—2(2), -4(4), -6(6),... sequence, with broken-symmetry states at 


states all extrapolate toa carrier density that vanishes at integer moiré 
band filling factors. 

We also find that the degeneracies of Landau levels originating from 
the CNP and v=~2 change when crossing the v=—l and v=—3 states, 
suggestive of first-order phase transitions that change band degen- 
eracies. In particular, as is shown in Fig. 3b, Landau levels from the 
v=~-2 and v=~—3 states display a criss-crossing pattern—superficially 
similar to that of a Hofstadter butterfly, but distinct in that the Landau 
level indices that can be traced to the v= —3 state are spaced by one 
filling, whereas those that can be traced to the v=—2 state are spaced 
by two fillings. 


n (10'2 cm-?) 


v_=—3,-5,—7,.... The v=—3 fan follows a single degenerate v, =—1, —2, -3, -4,... 
sequence. Emergent correlated phases at all integer moiré fillings, including the 
CNP (v=0), are highlighted in dark red. Chern insulating states are highlighted in 
orange. b, Magnification of a around the v=—3 state, showing signatures similar 
toaHofstadter butterfly spectrum with criss-crossing Landau levels fanning out 
from v=-3 and v=~2 filling states. 


Itis noteworthy that neither of the v=+1 correlated states show clear 
formation of Landau levels. The positions of their resistance maxima 
do, however, exhibit clear dependencies on the magnetic field and the 
carrier density. At v=—1the resistance state has no slope (dn/dB) at low 
field; however, above a critical field B; = 3.6 T, we observe the sudden 
development of a slope that is consistent with a Chern number of 1. 
Furthermore, at v=1 the position of the resistance peak shifts to lower 
carrier density, with a slope that is consistent with a Chern number of 
2. The slope of dn/dB in the absence of a Landau-level fan in the v= +1 
correlated states is consistent with Chern insulating states from spin 
and valley symmetry breaking at odd values of v. As discussed earlier 
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Fig. 4 | Field-driven phase transition near the v=~—1 state. a, Longitudinal 
resistance plotted as a function of carrier density and out-of-plane magnetic 
field measured at 16 mK. The orange traces show longitudinal resistance plotted 
against carrier density with the magnetic field (from top to bottom) fixed at 4 T, 
3.8T, 3.6 T and 3.4 T. b, Longitudinal resistance plotted against temperature at 
various magnetic fields. c, Longitudinal resistance plotted against magnetic 
field at various temperatures, with dashed and solid lines corresponding to 
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increasing and decreasing magnetic field, respectively. d, Dependence of the 
critical magnetic field (extracted from up sweeps) and the hysteresis value on 
temperature. Note that the transition above 800 mK is not sharp, adding 
uncertainty to the extracted critical field values. In b-d, the carrier density is 
fixed at -8.43 x 10" cm?. e, Dependence of the critical magnetic field and the 
hysteresis value onthe carrier density at100 mK. 


andas predicted by mean-field theory, valley-projected bands in insulat- 
ing states can have non-zero Chern numbers that compete closely with 
states with zero Chern numbers. Although we cannot resolve quantized 
values in R,, nor zero resistance in R,,,, as expected for a Chern insulat- 
ing state, we do not do so for the other Landau levels in Fig. 3a either. 
We therefore conclude that our devices are still too inhomogeneous to 
observe quantization over the entire device. 

Exactly at the transition at which the slope of the v=~—1 resistive state 
in Fig. 3a changes from dn/dB = 0 to a dn/dB consistent with Chern 
number 1, we find a strong hysteretic increase of R,,,and R,,; this is indica- 
tive of a possible magnetic-field-induced first-order phase transition. 
Figure 4a displays a plot of R,,as a function of nand B,. Figure 4b displays 
the temperature-dependent resistance R,,.(7) near v=—1 (or n=-8.43 
x 10" cm”) for a series of magnetic field values. Whereas at B, = OT, 
R,,{7) shows a typical metal-superconductor phase transition, above 
B, >3.6 Tand below 7<0.9K, R,,(7) has a sharp jump and an insulating 
temperature dependence. 

Figure 4c shows plots of R,,, against the magnetic field at v= —1 for 
up and down sweeps of the magnetic field. Below 800 mK, the curves 
show sharp jumps in resistance at associated critical transition fields 
B,, and demonstrate strong hysteretic behaviour that is dependent on 
the sweeping direction of the magnetic field; the width of the magnetic 
field of the hysteresis loop is denoted by AB,. The critical field B, is always 
higher for up sweeps than for down sweeps. 

BothB, and AB, are highly temperature-dependent, with B, shifting to 
higher values and AB; becoming smaller as the temperature increases. At 
T>800 mK, the hysteresis almost disappears and the transition becomes 
broader. The temperature dependencies of B, and AB, were extracted 
and are shown in Fig. 4d. The phase transition and hysteresis occur 
over a narrow range of carrier densities from around -8.3 x 10" cm to 
-9 x10" cm (Extended Data Fig. 8b, c) with B, and AB, at different carrier 
densities shown in Fig. 4e. Overall, we observe similar behaviour in Hall 
resistance measurements (Extended Data Fig. 8d). These observations 
indicate that the origin of the change in the slope dn/dB of the resist- 
ance maximum is a first-order phase transition, and is probably due to 
acompetition between correlated states with zero and non-zero Chern 
numbers at high magnetic fields”°, suggesting the emergence ofa field- 
stabilized orbital magnetic state. 

Notably, we have observed superconducting domes close to charge 
neutrality. To our knowledge, these states represent the lowest carrier 
density (n=3 x10" cm; counting from CNP) at which superconductivity 
has been observed. The existence of superconducting domes across a 
wide range of moiré band fillings must have important implications for 
our understanding of their origin. Because the density of states dimin- 
ishes close to the CNP, the appearance of superconductivity seems not to 
be simply related toa high density of states of the non-interacting bands. 
Superconductivity occurs adjacent to insulating states that seem—on 
the basis of Landau fan patterns—to break spin-valley degeneracy, and 
adjacent to insulating states that do not. Nevertheless, its consistent 
association with nearby correlated insulator states suggests an exotic 
pairing mechanism. Conversely, at this point our observation cannot 
rule out the possibility of conventional electron-phonon coupling 
superconductivity in metallic states with quasiparticles that evolve 
adiabatically from those of the non-interacting system and compete with 
arich variety of distinct insulating states from which they are separated 
by first-order phase transition lines””8, In this case, it is possible that 
the consistent high density of states over a broad range of filling factors 
helps to support superconductivity in the metallic state. 
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Methods 


Device fabrication 

Extended Data Fig. 1 displays a step-by-step stacking process for the 
fabrication of twisted bilayer graphene (tBLG) with a graphite bottom 
gate. The hBN/tBLG/hBN/graphite stacks were exfoliated and assem- 
bled using a van der Waals assembly technique. Monolayer graphene, 
thin graphite and hBN flakes (around 10-nm thick) were first exfoli- 
ated on SiO, (around 300 nm)/Si substrate, followed by the ‘tear and 
stack’ technique” with a polycarbonate (PC)/polydimethylsiloxane 
(PDMS) stamp to obtain the final hBN/tBLG/hBN/graphite stack. The 
separated graphene pieces were rotated manually by a twist angle of 
around 1.2-1.3°. We purposefully chose a larger twist angle during the 
assembly of the heterostructure owing to the high risk of relaxation 
of the twist angle to random lower values. To increase the structural 
homogeneity, we further carried out a mechanical cleaning process to 
squeeze the trapped blister out and release the local strain®° (Extended 
Data Fig. 2). To avoid the uncertainty induced by thermal expansion of 
the transfer stage, all the stacking process were carried out at a fixed tem- 
perature of 100 °C, except that the final stacks were released at 180 °C 
(the melting point of polycarbonate). We did not perform subsequent 
high-temperature annealing to avoid relaxation of the twist angle. We 
further patterned the stacks with PMMA resist and CHF, + O, plasma and 
exposed the edges of graphene, which was subsequently contacted by 
Cr/Au (5/50 nm) metal leads using electron-beam evaporation (Cr) and 
thermal evaporation (Au). 


Measurement 

Transport measurements were carried out ina dilution refrigerator with 
a base temperature of 16 mK and a perpendicular magnetic field of up 
to5T. The dilution refrigerator was well filtered to avoid heating of the 
electrons in our devices. We use superconducting-type coaxial cables 
(around 2 m long; Lakeshore) from the room-temperature plate to the 
mixing chamber plate of the cryostat. We add oneachlinea pi filter (RS 
239-191) at room temperature, and a powder filter (Leiden Cryogenics) 
as well as atwo-stage resistor—capacitor filter ona printed circuit board 
(R=1kQ, C=100 nF) at the mixing chamber plate. The total resistance 
of each line is about 2 kQ. The sample is located in a copper box with 
coaxial feedthroughs. 

Standard low-frequency lock-in techniques were used to measure 
the resistance R,,. and R,, with an excitation current of about 1nA ata 
frequency of 19.111 Hz. In the measurement of differential resistance 
dV/d/, an a.c. excitation current (around 0.5 nA) was applied through 
an a.c. signal (0.5 V) generated by the lock-in amplifier in combination 
with a 1/100 divider and a 10-MQ resistor. Before combining with the 
excitation, the applied d.c. signal passed through a 1/100 divider and 
a1-MQ resistor. As-induced differential voltage was further measured 
at the same frequency of 19.111 Hz with standard lock-in technique. For 
measurements in strong magnetic fields we found that the increased 
contact resistance made it difficult to obtain accurate values of the 


device resistance. To resolve this issue, we applied a global gate volt- 
age (+20 V) through Si/SiO, (around 300 nm) to tune the charge carrier 
density separately in the device leads. 


Twist angle extraction 

The total carrier density n tuned by gate is calibrated by Hall measure- 
ments at low field (Extended Data Fig. 3). Near charge neutrality and 
band insulating states, Hall charge carrier density (n,=—B/(eR,,)) should 
closely follow gate-induced carrier density n; that is, dn,/dn=1, provid- 
ing accurate measurements of the carrier density n. 

For different integer (v) moiré filling states, the total carrier density 
can be described by ving = vAg! =4v(1-cos6)//3 a2, where Apis the unit 
cellarea of the periodic moiré pattern, Ois the twist angle and a=0.246nm 
is the lattice constant of graphene. The local twist angles between dif- 
ferent contacts are extracted with the carrier densities of v = 2 states 
shown in Extended Data Fig. 4. The carrier density difference between 
CNP and v = 2 states in device D1 ranges from 1.38 x 10" cm” to 
1.45 x 10” cm”, corresponding to local twist angles ranging from 1.09° 
to 1.12°. For device D2, the local twist angles range from 1.08° to 1.10°. 


Data availability 


The data that support the findings of this study are available from the 
corresponding author upon reasonable request. 
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Extended Data Fig. 1| Schematic of the stacking process for the fabrication of twisted bilayer graphene with graphite bottom gate. a-h, Sequential device 
fabrication method, describing the tear-and-stack co-lamination process used to create the hBN/tBLG/hBN/graphite stacks. 
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Extended Data Fig. 2| Mechanical cleaning of twisted bilayer graphene. a—d, Optical images of the final stacks before mechanical cleaning (a, c) and after 
mechanical cleaning (b, d). 
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Extended Data Fig. 3 | Hall measurements of device D1. Coloured verticalbars —_ neutrality, n,,=n. Beyond the band insulator regions (v= +4), the Hall density 
correspond to filling factors v=—4, -2,2 and 4. Hall charge carrier density strictly follows ny =n+4No. 
(ny =-B/(eR,,)) closely follows the gate-induced carrier density n. Near charge 
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Extended Data Fig. 4 | Measuring the homogeneity of the twist angle. taken between contacts shown inaandb. Colours correspond to the bars shown 
a-d, Atomic force microscopy images of a set of twisted bilayer graphene inaandb, respectively. The difference in carrier density between the CNP and 
samples. Scale bar, 2 um. Dashed-line arrows correspond to the height profiles the v=2 state is used to extract the local twist angle. 


shown below the topographies. e, f, Two-terminal conductance measurements 


R,{kQ) 


n(10" cm”) 


Extended Data Fig. 5| Four-terminal longitudinalresistanceasafunctionof 69K (black trace) to 16 mK (red trace). Coloured vertical bars correspond to the 
carrier density at different temperatures. The four-terminal longitudinal filling factors vas shown. 
resistance is plotted against carrier density n for different temperatures, from 
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superconducting domes. a-d, Differential resistance measurements for 
additional domes between —4n, and —3ng (a), -2n) and —Ny (b) 2Ng and 3ng (c) and 


3n, and 4n, (d).e-h, Corresponding thermal activation measurements of 
resistance against temperature for the same carrier densities as ina-d, 
respectively. 
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Extended Data Fig. 7 | Full characterization of all four superconducting 
pockets insample D1. a-d, Thermal activation measurements of resistance 
against carrier density. The inset shows magnified images, demonstrating that 
in three superconducting states the resistance drops completely to zero (a, b, d) 
and in one superconducting state the resistance saturates at about 80 O(c). 
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e-h, Differential resistance is plotted against d.c. bias current at various 


temperatures in order to establish BKT transition 


temperatures. 
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i-I, Two-dimensional colour plots of the differential resistance as a function of 


magnetic field and excitation current at 16 mK. 
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Extended Data Fig. 8 | Additional magnetic hysteresis data. a, Four-terminal 
longitudinal resistance as a function of carrier density at different temperatures 
from 50 mK (black trace) to 5.2 K (red trace). b,c, Plots of longitudinal resistance 
R,,,and transverse resistance R,, against magnetic field at different charge 
carrier densities and 100 mK. Arrows indicate the sweep direction of the 


Lt 


magnetic field. Data from bis used to extract data for Fig. 4e. d, Transverse 
resistance plotted against magnetic field at different temperatures (the same 
dataset as in Fig. 4c). Dashed and solid lines correspond to ascending and 
descending magnetic fields, respectively. 


Extended Data Table 1| Full dataset for all observed superconducting states in device D1 


pet T, (mK) Text (MK) | Berg (MT) | Ferro) (AM) 
“1,73 * 10" on 3000 600 ~180 ~41 
-7.6 X 101 cm? 140 75 ~100 ~55 
5X 10" cm? 160 110 ~300 ~32 
1,44,% 10” cm 650 580 ~400 ~27 
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The engineering of biological molecules is a key concept in the design of highly 
functional, sophisticated soft materials. Biomolecules exhibit a wide range of functions 
and structures, including chemical recognition (of enzyme substrates or adhesive 
ligands’, for instance), exquisite nanostructures (composed of peptides’, proteins’ or 
nucleic acids‘), and unusual mechanical properties (suchas silk-like strength’, 
stiffness’, viscoelasticity’ and resiliency’). Here we combine the computational design 
of physical (noncovalent) interactions with pathway-dependent, hierarchical ‘click’ 
covalent assembly to produce hybrid synthetic peptide-based polymers. The 
nanometre-scale monomeric units of these polymers are homotetrameric, a-helical 
bundles of low-molecular-weight peptides. These bundled monomers, or ‘bundlemers’, 
can be designed to provide complete control of the stability, size and spatial display of 
chemical functionalities. The protein-like structure of the bundle allows precise 
positioning of covalent linkages between the ends of distinct bundlemers, resulting in 
polymers with interesting and controllable physical characteristics, such as rigid rods, 
semiflexible or kinked chains, and thermally responsive hydrogel networks. Chain 
stiffness can be controlled by varying only the linkage. Furthermore, by controlling the 


amino acid sequence along the bundlemer periphery, we use specific amino acid side 
chains, including non-natural ‘click’ chemistry functionalities, to conjugate moieties 
into a desired pattern, enabling the creation of a wide variety of hybrid nanomaterials. 


Our bundlemer-based polymer chains exhibit a variety of unique fea- 
tures. Unlike high-molecular-weight synthetic polymers, our chains 
use small (roughly 3 kDa), easily synthesized peptide sequences that 
fold into designed tetrameric 4-nanometre bundles. The subsequent 
covalent assembly of these bundles yields polymers with micrometre- 
scale contour lengths. The design of a-helical homo-oligomers has a 
long history, with both empirical de novo’ and computational’ methods 
being used. Here, computationally designed homotetrameric bundles 
with D, symmetry” present two reactive groups at each end, owing to 
chemical functionalization of the amino termini of the constituent pep- 
tides (Fig. 1a). Distinct homotetrameric bundles with complementary 
reactive functional groups are chemically linked (or ‘clicked’ together) 
to produce bundlemer chains. 

Our first example of a bundlemer chain uses two different coiled-coil 
peptide bundles” with complementary reactive groups for covalent 
polymerization (Fig. 1a). One peptide (peptide 1; Extended Data Fig. 1) 
contains the non-natural organic group maleimide at the N terminus, 
resulting in two maleimides at each end of the folded bundlemer. The 
second peptide contains the amino acid cysteine at the N terminus 


(peptide 2; Extended Data Fig. 1). The maleimide and the thiol group 
of cysteine undergo a thiol-Michael addition reaction (Extended Data 
Fig. 2) to form two covalent bonds between neighbouring bundlem- 
ers, producing a hybrid polymer chain that is maintained by both the 
covalent linkages and the complementary, noncovalent (physical) 
interactions within each bundlemer. The resulting polymers exhibit 
an unprecedented rigid-rod character. Both negatively stained trans- 
mission electron microscopy (TEM) and cryogenic TEM (cryoTEM) 
data reveal extreme chain stiffness (Fig. lb—d). Direct imaging of the 
rigid rods is possible because the polymer cross-section is defined by 
constituent bundlemers. Small-angle neutron scattering (SANS) of the 
rigid-rod polymers confirms that the rod cross-section is that of a single 
bundlemer with a diameter of roughly 2 nm (Extended Data Fig. 3). 
Although one-dimensional (1D) assemblies of proteins can exhibit 
a high rigidity (as in viruses", actin®“, microtubulin® and misfolded 
globular proteins’), all are assemblies of much larger proteins, with 
many interprotein interactions, much wider cross-sections, and large 
ratios of protein molecular mass to linear length of the rod”. Similarly, 
other physically assembled coiled-coils!’”? have produced stiff fibres 
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Fig. 1| Peptides designed for homotetrameric, antiparallel coiled-coil 
bundle formation” that form polymers upon covalent assembly. a, Left, 
peptides 1 and 2 (Extended Data Fig. 1), shownin single-letter amino acid code, 
have at their N termini (blue) either maleimide (Mal) or cysteine (C). The carboxyl 
terminus (red) of each peptide is unreactive. Each sequence forms 
homotetrameric bundlemers: grey, peptide 1; white, peptide 2. Centre, the 
thiol-maleimide click reaction yields chains with two covalent linkages between 
neighbouring bundlemers. b, TEM of rigid rods produced with a 1/1 ratio of 
peptides land 2. The sample is negatively stained with phosphotungstic acid 
(PTA). c, CryoTEM of rigid rods longer than 11min aqueous solution. 

d, Negatively stained TEM of short rigid-rod chains produced using an 
asymmetric ratio (10/9: [peptide 1]/[peptide 2]) of reacting bundlemers. e, The 
organic tetrathiol PETMP (black wavy lines) links peptide-1 bundlemers to form 
semiflexible chains. f, Examples of segmented chains produced by connecting 
short rigid rods with PETMP. Rod segments within the segmented polymers 
range in length from approximately 50 nm (where n, the number of bundlemers 
per segment, is approximately 3 to 4) to 100 nm (where nis approximately 8 to 9). 


with large cross-sections relative to the protein or peptide building 
blocks. Natural fibrils that are rich in B-sheets made from misfolded 
proteins—suchas bovine serum albumin” or B-lactoglobulin”’—as well 
as short de novo designed peptides” exhibit large persistence lengths 
([,) that range from 10 nm for thinner fibrils up to approximately 10 ym 
for thicker fibrils. However, the stiffer, thicker B-sheet fibrils must have 
a substantial mass per length in order to produce sucha high-/, 1D nano- 
structure. The biopolymer with the cross-section that is closest to that 
observed here is double-stranded DNA; however, DNA hasan /, of 50nm 
or less (depending on solution conditions)*. The TEM and cryoTEM data 
in Fig. 1 reveal rods longer than 1 pm that display rigid-rod behaviour 
along the entire chain length. Estimates of the rod persistence length 
that are based on methods developed for rod and fibre analysis in two 


dimensions” provide values of more than 30 jm (Extended Data Fig. 4), 
highlighting the extraordinary stiffness of such a thin molecular object. 
To put this in context, in Extended Data Fig. 5 we plot the persistence 
length versus the mass per unit length for our bundlemer rigid rods and 
for the other nanostructures cited above. 

Hybrid physical-covalent interactions between proteins have until 
now produced flexible chains, owing to the flexibility of the linear poly- 
mer linkers” or dimeric coiled-coils” that link the final polymer together. 
Hexameric peptide coiled-coils have produced semiflexible nanotube 
chains through electrostatic interactions” or native chemical ligation; 
these semiflexible nanotubes exhibited anirregular chain trajectory and 
were shorter than the rigid rods produced here. An important aspect of 
our bundlemer chain rigidity is the use of antiparallel homotetrameric 
bundlemersas the building blocks for polymerization. Formation ofboth 
possible covalent linkages between two bundlemers is highly probable 
(Fig. 1a), because once one covalent bond between the bundlemers has 
formed, a second crosslink between the same pair is more likely than 
a linkage to a third bundle. The thiol and maleimide reaction sites are 
closely located at the ends of the rigid bundlemers, and the dual link- 
age between adjacent bundlemers restricts conformational flexibility 
within the final rigid-rod chain. This designed crosslinking contrasts with 
linkages among parallel hexameric coiled-coils, where defects in linking 
produce semiflexible chains with possible branch points”. The design 
of two covalent linkages between adjacent peptide bundles with D, sym- 
metry minimizes chain defects and provides extreme polymer rigidity. 

The efficiency of the thiol-Michael reaction allows the targeting of 
rigid-rod length through the simple stoichiometric control of reactants. 
The ultralong (micrometre-size) rigid rods in Fig. 1b, cwere synthesized 
witha 1/Lratio of bundlemers displaying thiol (peptide 2) and maleimide 
(peptide 1) groups. By simply altering the bundle ratio (to 10/9; see Meth- 
ods), much shorter rigid rods can be produced (Fig. 1d). 

Just as we are not limited to a single bundlemer building block or 
polymer length, we also can use different organic interbundle linkers 
to alter the physical characteristics of the chain, while still producing a 
targeted bundlemer pattern along the chain. For example, bundles func- 
tionalized with maleimide (peptide 1) react with the organic tetrathiol 
pentaerythritol tetrakis(3-mercaptopropionate) (PETMP; Extended 
Data Fig. 6), to produce a semiflexible chain (Fig. le). The chain flexibility 
must be due solely to the molecular flexibility of the PETMP, because 
the same maleimide-functionalized bundlemer was also used to make 
rigid-rod chains. SANS of the semiflexible chains in solution indicates 
achain cross-section of one peptide bundle, as observed with the rigid 
rods (Extended Data Fig. 3). We can also combine rigid-rod and semiflex- 
ible segments within a single polymer. Short rigid rods with maleimide- 
functionalized bundlemers at the rod termini can be reacted with PETMP 
to produce kinked, segmented chains (Fig. 1f), with the kinks resulting 
from the conformational flexibility of PETMP. Thus different polymer 
architectures and flexibilities are possible using the same bundlemer 
building blocks, thereby separating the characteristics of the chain 
contour from the designed amino acid sequence. 

The hybrid physical-covalent peptide rigid rods exhibit lyotropic liq- 
uid crystalline behaviour in concentrated solution, with optical textures 
typical of lamellar phases. Toric focal conic domains (TFCDs) can be 
observed in smectic liquid crystals when confined to thin films in which 
the smectic layers are generally parallel with the sample substrate”’. 
Here, we observe apparent TFCDs in polarizing optical microscopy 
of thin films of concentrated rod solutions (Fig. 2). Peptide or protein 
fibrillar assemblies are known to form liquid-crystal phases owing to 
their inherent rod-like character”. Given the extreme stiffness of our 
new physical-covalent bundlemer rods, the ability to target desired 
distributions of rod length, and the ability to alter rod chemistry using 
computational design and non-natural amino acids, these systems pro- 
vide new opportunities for liquid-crystal material design. 

We can purposefully display and pattern chemical moieties rang- 
ing from small-molecule dyes to synthetic polymers to inorganic 
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Fig. 2| Liquid-crystal behaviour of concentrated rigid-rod solutions. Rods 
were prepared as in Fig. 1a, d with alternating bundlemers of peptide 1and 
peptide 2.a, Polarized optical microscopy (POM) ofa pseudoisotropic region of 
roughly 100-nm-long rods with multiple TFCDs, indicative of alyotropic lamellar 
phase. POM was performed onan 8% (w/v) short rigid-rod solution at pH 2. 

b, TEM of negatively stained short rigid rods from the pseudoisotropic region 
shown ina, revealing clear rod layering. c, TEM reveals the structure of dilute 
regions in which rigid rods have locally aggregated into droplets with clear rod 
orientation. d, Bottom, diagram of a TFCD cross-section formed in smectic-A- 
type liquid crystals. Top, enlargement ofa single smectic layer, showing the 
proposed homeotropic alignment of individual rigid rods. The blue dashed lines 
represent boundaries between smectic layers confined between parallel walls 
(thick black lines represent the glass slide and cover slip in the POM). The liquid- 
crystal director n, the axis along which all rods are aligned within individual 
layers, is perpendicular to the smectic layers. The local orientation director 
(grey arrows) within the smectic A layers is parallel ton far from the TFCD. Inthe 
vicinity of a topological defect on the glass substrate (yellow), the local 
orientation field folds towards the defect. 


nanoparticles. Super-resolution fluorescence microscopy” provides the 
tool for observing rigid rods synthesized with fluorophores attached to 
specific lysine side chains. We first assembled rods with targeted lengths 
of around 50 nm (Fig. 1), with the rod termini consisting of bundlem- 
ers displaying thiols (Fig. 3a; see Methods for more details). We then 
functionalized these short rods with either a green (peptide 3) or ared 
(peptide 4) fluorescent dye (green, 5(6)-carboxy-tetramethylrhodamine; 
red, 4-chloro-7-nitrobenzofurazan; Fig. 3a). We mixed separate popula- 
tions of short red rods and short green rods with unlabelled bundles with 
maleimide termini (Figs. 1a, 3a) in order to link the short, dye-labelled 
rods into longer rigid rods. Stochastic optical reconstruction micros- 
copy (STORM) allows clear observation of the individual approximately 
50-nm red or green segments within the longer rods (Fig. 3a). 
Bundlemers within the rigid-rod chains can denature at high tempera- 
tures, resulting in disassembly of the chains, but the bundlemers can 
reversibly reform rigid rods below the bundlemer melting temperature. 
Inthe case of the rigid rods in Fig. 3, when the temperature was increased 
to 90 °Call constituent bundles denatured, causing the rigid rods to fall 
apart into covalently linked dimers of peptides, each dimer containing 
one dye-labelled peptide and one peptide without dye (Fig. 3b). When 
the temperature was reduced to 25 °C, the bundlemers reassembled 
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Fig. 3| Reversible noncovalent assembly of rod polymers. a, Rigid rods were 
created using fluorescently labelled variants (peptide 3 (right) and peptide 4 
(left); Extended Data Fig. 1), each containing either 4-chloro-7- 
nitrobenzofurazan (green) or 5(6)-carboxy-tetramethylrhodamine (red) 
attached to the lysine-24 side chain. Bundlemers of peptide 2 (centre, white) 
were used to form short rigid rods comprising a single dye type. The resulting 
red and green rods were joined with peptide-1 bundlemers (grey) to make longer 
rods with red and green segments. The STORM images beloware of resulting 
longer rigid rods. The constituent red or green fluorescence of each segment is 
easily resolved. b, Rigid rods from aare heated to 90 °C, resulting in unfolding 
and dissociation of the individual bundles while peptide dimers remain 
covalently linked. When the solution is then cooled to 20 °C, the bundlemers and 
rigid rods reform. c, Reassembled rods now display co-localization of green and 
red fluorescence (producing a yellow signal when the green and red channels are 
displayed concurrently) along the entire reformed rod lengths. 


and physically repolymerized into rigid rods. However, the dyes that 
were originally segregated into roughly 50-nm red or green segments 
within the longer rigid rods were now mixed along the entire length of 
the reformed rigid rods (Fig. 3c). This was a direct effect of the assembly 
pathway: the segregation of dyes within the rods created by the original 
pathway was erased by temperature changes to create a similar rigid-rod 
superstructure but with the dye segregation now scrambled. 

The use of high-molecular-weight organic reagents to link bundlemers 
together® yields thermally responsive hydrogels. Maleimide-bearing 
bundlemers coupled to a 20-kDa tetrathiol with four poly(ethylene 
glycol) (PEG) arms (Extended Data Fig. 6) act as crosslinking pointsina 
hybrid bundlemer-PEG hydrogel network. The type of bundlemer used 
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Fig. 4 | Display of non-natural side chains by bundlemers affords templated, 
patterned display of polymers from rigid rod bundlemers or assembly of 
hybrid metal nanoparticle-bundle chains. a, AFMimage of rigid rods formed 
using peptides 2 and 6 (Extended Data Fig. 1), with azide-functionalized PEG 99 
chains conjugated to the rigid rods. b, AFM image of the rigid-rod area within the 
white outline in a; the areain the green rectangle was used for height analysis 
along the rod longitudinal axis (d). c, Diagram illustrating bundles of peptide 6 


as the crosslinker controls the temperature response of the network. For 
example, the maleimide-terminated peptide-1 bundlemer—which hasa 
melting temperature, 7,,, of 55 °C (ref. *°)—produces a hydrogel that can 
be reversibly disassembled and reformed by cycling the temperature 
above and below this temperature (Extended Data Fig. 7). By contrast, 
the peptide-5 bundlemer has a 7,, of 80 °C (ref. !°), and produces a net- 
work that is robust to much higher temperatures (Extended Data Fig. 7). 

The structure of the bundlemer polymer can also be leveraged to pat- 
tern additional, orthogonal click reactions. Peptide 6 (Extended Data 
Fig. 1) contains a lysine at residue 13 that is functionalized to present an 
alkyne (Supplementary Information, scheme S5), which can react with 
anazide to form atriazole linkage via copper(I)-catalysed azide-alkyne 
cycloaddition (CuAAC). We further functionalized peptide-6-based 
bundlemer rods with azide-terminated PEG polymers (PEG,99,). Atomic 
force microscopy (AFM) studies (Fig. 4) reveal height variation along 
the longitudinal rod axes, with a peak spacing of approximately 10 nm 
(Fig. 4d), consistent with the expected spacing of the PEG-functionalized 
bundles within the rigid rod. 

Engineered click reactions can be used to link desired moieties to 
precise locations along existing bundlemer chains, but it is also pos- 
sible to conjugate moieties to individual peptides before bundlemer 
formation, owing to the noncovalent self-assembly of the bundlemers 
themselves. We designed peptide 7 (Extended Data Fig. 1) to display a 
thiol group only at residue 24 and conjugated it with maleimide-func- 
tionalized gold nanoparticles (Fig. 4). Subsequent aqueous assembly 
produces a dominant nanostructure of gold nanoparticle chains with an 
interparticle spacing of less than 5 nm, consistent with the interspacing 
of bundlemers between adjacent nanoparticles. The hybrid nanopar- 
ticle-bundlemer chains exhibit variation in the apparent distances 
between the gold nanoparticles and changes in the chain trajectories 


aye 


(grey) and peptide 2 (white) conjugated with PEG, 999. d, Height trace along the 
longitudinal axis in b. e, Left, maleimide-functionalized gold nanoparticles are 
conjugated with peptide 7 (Extended Data Fig. 1), and then allowed to assemble 
into hybrid nanoparticle-bundlemer chains (right). f, TEM of nanoparticle-— 
bundlemer chains. g, Magnified TEM images of the indicated nanoparticle- 
bundlemer chains in f reveal interparticle separation consistent with the 
dimensions of peptide bundles. 


owing to the many possibilities for nanoparticle-bundlemer covalent 
connection. 

The computational design of coiled-coil building blocks, combined 
with the engineering of covalent interactions, makes it possible to gener- 
ate new types of physical-covalent programmable polymers, peptide 
liquid crystalline materials, and even one-, two- and three-dimensional 
nanostructures. The ‘bundlemer’ conceptalso allows the functionality 
of these structures to be altered with natural and non-natural amino 
acids. Overall, bundlemers provide a simple, versatile toolbox for a wide 
range of materials design and refinement, all while harnessing the design 
possibilities and function afforded by biologically inspired peptides. 
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Methods 


Schemes for synthesis and details of all amino acids, peptides and 
functionalized gold nanoparticles are described in detail in the Sup- 
plementary Information. 


Preparing bundlemer polymers by the thiol-maleimide reaction 
Rigid rods. We prepared two separate solutions, one of peptide 1 and 
one of peptide 2 (Extended Data Fig. 1), each containing 1 mM peptide 
in phosphate buffer (pH 6, 25 mM), for the respective formation of each 
homomeric bundle. We then mixed equal volumes of the two bundle 
solutions and added 0.2 eq. of tris(2- carboxyethyl)phosphine (TCEP; 
50 mM in distilled water). The bundle mixture was shaken overnight at 
room temperature to produce the rods. 


Alkyne-containing rods. To make rods of peptide 6 and peptide 2 (Ex- 
tended Data Fig. 1) for PEG conjugation, we added 1 pmol of each peptide 
as freshly lyophilized powder to a 2-ml Eppendorf tube. To the solid 
mixture, we added 200 ul of Millipore H,O to make a 5-mM solution of 
bundles, and vortexed the solution. The pH of the system was checked 
(the pH was roughly 6). The solution was mixed at room temperature 
for 12 hand at 60 °C for an additional 12 h. 


Representative maleimide-excessive short rigid rods. We prepared 
separate solutions of peptide 1 and peptide 2in phosphate buffer (pH 6, 
25 mM) at 1% w/v. We mixed different volumes of the two bundle solu- 
tions to achieve a molar ratio of maleimide/thiol of 10/9 (for example, 
we could mix 0.528 ml of peptide-1 solution with 0.472 ml of peptide-2 
solution to make a 1-ml1% w/v solution). We then added TCEP (50 mM in 
distilled water) at 0.2 eq. relative to peptide 2. The mixture was shaken 
overnight at room temperature to produce the short rigid rods. 


Semi-flexible chains. We prepared separate solutions of 1mM peptide 
land 0.25 mM PETMP (Extended Data Fig. 6; Sigma) in phosphate buffer 
(pH 6, 25 mM). We mixed the same volumes of the two solutions, and 
added 0.2 eq. of TCEP (50 mM in distilled water) relative to peptide 1. 
The mixture was shaken overnight at room temperature to produce the 
semi-flexible chains. 


Kinked chains. We prepared 1 ml of the 1% w/v short rigid rod solution 
as described above (with an excess of maleimide-containing bundles in 
order to guarantee that the short rods had maleimide termini; the molar 
ratio of maleimide/thiol was 10/9). PETMP (2.9 pl, SO mM) was added to 
produce a maleimide/thiol ratio of 1. The mixture was shaken at room 
temperature for a week to produce kinked chains. 


Conjugation of azide-functionalized PEG to alkyne- 
functionalized rods 

Stock solutions in Millipore H,O were prepared of CuSO, (200 mM), 
tris-hydroxypropyltriazolylmethylamine (THPTA; 400 mM) and sodium 
ascorbate (1M). To a1.5-ml Eppendorf tube, we added 40 ul of 5-mM 
peptide, which had previously been assembled into rods. We then dis- 
solved N;-PEG, 999 (2 mg; 5 eq.) in 160 pl of Millipore water and added 
this to the peptide-rod solution. From the respective stock solutions 
we added 1 pl of CuSO, (2 eq.), 1 pl of THPTA (4 eq.) and 2 pl of sodium 
ascorbate (10 eq.). The solution was purged with nitrogen and then 
vortexed and mixed at 40 °C for 48 h. 


Formation of gold-nanoparticle chains with peptide-7 bundles 

We dissolved 1 mg of gold nanoparticles functionalized with maleimide 
groups (Supplementary Information scheme 7) in 1 ml dimethyl forma- 
mide (DMF) to make atransparent yellow solution. We added 5 pl of DMF 
solution consisting of 1 mM thiol-functionalized peptide 7 (Extended 
Data Fig. 1) and 0.2 eq. TCEP to the gold-nanoparticle DMF solution. The 
mixture was reacted overnight at room temperature to produce gold 


nanoparticles conjugated with peptide 7. The solution was then dialysed 
against pure water to afford coiled-coil bundle formation by the conju- 
gated peptide 7, producing chains of nanoparticle-peptide-7 bundles. 


Representative hydrogel of peptide 1 with four-arm PEG 
tetrathiol 

We dissolved 8 mg of peptide 1 and 10.86 mg of four-arm PEG tetrathiol 
(20 kDa; JenKem Technology) in 620 pl phosphate buffer (pH 6, 25 mM). 
We added 0.2 eq. TCEP (9 1,50 mM in distilled water) relative to peptide. 
The mixture was shaken at room temperature. The hydrogel usually 
formed after two to four hours of reaction. 


Transmission electron microscopy 

Carbon-coated 200-mesh copper grids (CF200-Cu; Electron Microscopy 
Sciences) were freshly treated by glow discharge using a plasma cleaner 
(PDC-32G; Harrica Plasma) at low level for 20 s. We then dropped 4 pl of 
sample solution onthe grid. After 1 min, the remaining liquid was blotted 
fromthe edge of the grid using filter paper. The grid was allowed to airdry 
for 10 min before TEM ona FEI TALOS F200C microscope. For negative 
staining, we applied 6 pl of an aqueous solution of phosphotungstic 
acid (2% w/v; pH 6) to the dried grid and incubated it for 15-30 s. The 
grid was then blotted with filter paper. The stained grid was allowed to 
sit for 10 min before TEM. 

CryoTEM imaging was also performed ona FEI TALOS F200C micro- 
scope with accelerating voltage at 200 kV. We used lacey grids (Ted Pella) 
for all grid preparation after oxygen plasma treatment. We prepared 
vitrified grids for cryoTEM using the Vitrobot, an automated plunge- 
freezing device that vitrifies a thin solution layer to the temperature 
of liquid nitrogen. A sample droplet of 1.5 pl was deposited onto the 
plasma-cleaned lacey grids. Depending on sample viscosity and concen- 
tration, we adjusted blotting parameters in order to obtain the optimal 
liquid-film thickness, usually requiring two to three blottings lasting 
1-2s at 100% humidity. After blotting, we allowed the sample grids to 
relax for 2s. In order to achieve an extremely fast rate of cooling of the 
homogenous vitreous layer, we plunged the grid into liquid ethane (at 
roughly -175 °C) and then transferred it to liquid nitrogen for storage. 
During imaging, the cryoTEM holder was maintained at -177 °C to pre- 
ventice crystallization or sublimation. The images were recorded with 
a FEI Ceta 16M (charge-coupled-device) or Falcon-II camera (CMOS) at 
alow electron dose. 


Small-angle neutron scattering 

SANS experiments were performed on the NG-B 30-m SANS instrument, 
part of the Center for High Resolution Neutron Scattering (CHRNS) at 
the National Center for Neutron Research (NCNR), National Institute 
of Standards and Technology, Gaithersburg, MD, USA. Installed on the 
60 mm x 60 mm split neutron guide NG-B, this instrument delivers a 
neutron beam of wavelength (A) approximately 6 A, with a resolution 
(AA/A) of 10% at full width at half maximum (FWHM). The detector 
installed on this instrument is a640 mm x 640 mm He position-sensi- 
tive counter with a resolution of 5.08 mm x 5.08 mm. We used sample- 
to-detector lengths of 1m,4mand13 mto cover ag-range of 0.0035 A* 
to0.4A ‘for scattering experiments involving the 1% w/v semi-flexible 
chains. (Here, gis the scattering vector given byg = 41 sin(@/2)/A, where 
Gis the scattering angle and Ais the neutron wavelength.) A wider g-range 
of 0.0015 A“ to 0.35 A? was covered for scattering experiments on 
1% w/v rigid rods, enabled by the additional use of neutron lenses at the 
13-m detector configuration. We corrected the raw data obtained from 
scattering experiments for background noise and radiation, detector 
sensitivity, and open beam transmission using IgorPro” software to 
obtain anormalized scattering intensity curve. We calculated standard 
deviations statistically using the number of averaged detector counts 
at each data point. The reduced ID scattering intensity obtained after 
buffer subtraction was fitted to various models using SasView” 
software. 
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Scattering from an isotropic solution of non-interacting, monodis- 
perse species is described by the general equation: 


I(q) = nV*(Ap)?P(q) 


Here, /(q) is the normalized scattered intensity as a function of the 
scattering vector q; nis the number density of scattering species; Vis 
the volume of each scatterer; Ap is the difference in scattering length 
density (SLD) between the scattering species and solvent; and P(q) is 
the form factor, given by the average geometric shape of the scattering 
species in solution. 

To fit the scattering curve from the rigid-rod solution in SasView, we 
chose a cylinder model, the P(q) of which is calculated by***>: 


sc 


1/2 
P(q)= ale J, f° (q, a)sinada + background 


where 
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f(q, a) = 2(Ap)V x 


Here, J, is the first-order Bessel function; ais the angle between the 
cylinder axis and the scattering vector gq; L is the length of the cylin- 
der; and ris its radius; ‘scale’ is the volume fraction of cylinders; and 
‘background is the background scattering intensity. Anintegral over a 
from Oto 1/2 radians averages the scattering intensity over all possible 
orientations of rods in an isotropic solution. 

For semi-flexible rods, a flexible cylinder model fit was performed in 
SasView. Its form factor P(q) is defined by the equation”’**: 


scale 
y (4,0) +bkg 


P(q)= 


Where f(g,a)’is a squared average scattering over all possible orienta- 
tions a for a given scattering vector qg. A worm-like semi-flexible cylindri- 
cal chain is used with a contour length L, radius rand Kuhn length, K,, 
of 21,, where J, is the persistence length of the worm-like chain™. This 
model also incorporates excluded-volume interactions between seg- 
ments of the worm-like chain in solution. 


Optical microscopy 

We prepared a 15% w/v solution of short rigid rods with an average length 
of roughly 200 nm by gently concentrating a dilute solution of rods asin 
ref. °°. Nitrogen gas was blown at alow flowrate into a flask containing a 
0.5% w/v solution of short rigid rods under constant mild agitation. This 
slowly concentrated the solution by evaporating water molecules from 
the exposed air—water interface, avoiding the formation of inhomoge- 
neities and nonequilibrium structures such as crystalline aggregates 
during the concentration process. The concentrated solution was then 
adjusted to pH 2 by adding a few drops of 1M hydrochloric acid. We 
added anhydrous sodium chloride to yield arod solution containing the 
salt at roughly 100 mM. Samples for polarized optical microscopy were 
prepared by adding 2 pl of the solution between a clean glass slide and 
acover slip. We immediately investigated the freshly prepared sample 
slides for birefringence resulting from the formation of liquid-crystalline 
phases under polarized light in transmission mode on an Olympus BX60 
light microscope at 20 °C. High-resolution images were captured by a 
Nikon DS-Fil digital camera and the images were analysed using NIS- 
Elements imaging software. 


Rheological measurements 

Rheological measurements were performed ona TA Instruments DHR-3 
rheometer (TA Instruments). A hydrogel (160 pl) was deposited onto the 
rheometer stage. A 20-mm stainless-steel parallel plate was used and 


the gap height was set to 500 um for measurement. Oil was applied to 
seal the sample. We monitored the storage modulus G’ and loss modu- 
lus G” under an applied strain of 0.01% to 10,000% at a frequency of 
1rad s“for the strain sweep, and a frequency of 0.1rads“to 200 rad s* 
at strain 0.1% for the frequency sweep. Temperature sweeps were per- 
formed from 25 °C to 80 °C at 5° increments. Temperature-reversible 
experiments were carried out by subjecting the gel to a strain of 0.5% 
and frequency of 1rads7. 


Stochastic optical reconstruction microscopy 

Preparation of solutions of rigid rods with red or green individual 
rod segments. We prepared two 0.5-ml solutions of short rigid rods 
with thiol termini through click reactions between identical volumes 
of peptides 2 (1 mM) and 3 (0.9 mM), or between identical volumes of 
peptides 2 (1mM) and 4 (0.9 mM) (Extended Data Fig. 1). We then mixed 
each of the two rigid-rod solutions with an identical volume containing 
the appropriate amount of peptide 1 (100 pl, 1 mM), to produce a ratio 
of total added maleimide groups to thiol end groups of 1/lin the entire 
solution. The mixture was shaken at room temperature for a week to pro- 
duce longer chains containing short rod segments of red or green dye. 


Denaturation and reassembly of rigid rods with mixed red and green 
segments. We heated a solution of rigid rods with red- and green-dye- 
containing segments to 90 °C for 10 min in order to denature the rods. 
We then incubated the solution at 4 °C for 24 hin order to reassemble 
the constituent bundles, resulting in rigid rods with green and red dye 
mixed along the entire length of the rods. 


STORM imaging. STORM images were taken on a Zeiss Elyra PS.1 super- 
resolution microscope. Rods were mounted on a high-precision 
22mm x 22 mm coverslip (Zeiss) by applying 10 pl of a rod solution for 
10s. The remaining liquid was removed using filter paper. The sample 
adhered to the cover slip was rinsed with phosphate buffer (pH 6, 25 mM) 
five times. We added an oxygen-scavenging buffer (540 mM glucose, 
3.11M catalase, 7.6 1M glucose oxidase, 10 mM NaCl, 20 mM cysteamine) 
in58 mM Tris-HCI to the sample just before image acquisition, and sealed 
itin amagnetic conflat flange chamber (Chamlide). STORM images with 
4-chloro-7-nitrobenzofurazan or 5(6)-carboxy-tetramethylrhodamine 
were taken with a Plan-Apochromat 100*/1.46 oil objective with 488-nm 
or 561-nm laser excitation, respectively. For each STORM image, we 
acquired 500 frames, aligned them using a model-based algorithm, and 
filtered them with a precision of 1-30 nm. For STORM imaging of rigid 
rods with individual rod segments containing either red or green dye, 
images were taken sequentially, merged, and then aligned. All image- 
processing steps were completed with Zen 2012 software. 


Atomic force microscopy 

We carried out AFM with a Bruker multimode system, using Bruker 
ScanAsyst Air ultrasharp tips with a nominal tip radius of 2nm anda 
spring constant of 0.4 Nm“. We used freshly cleaved mica discs (Ted 
Pella) as substrates for sample deposition. Samples were prepared by 
spin-coating 20 pl of a roughly 5 mM sample of PEG,o99-conjugated 
bundlemer rods onto a freshly cleaved mica disc using a spin-coater 
(WS-650SZ, Laurel Technologies). In a first stage the spin-coater was 
set to 500 r.p.m. for 10 s, during which the sample solution was pipet- 
ted onto the substrate. The speed was then increased sequentially to 
2,000 r.p.m. and held for 3 min at the second stage to spin off the liquid. 
During the 3 min of spin coating, 20 ul of Milli-Q water was applied to 
wash off excess salts. The mica disc appeared to be dry after the spin- 
coating and was rested for at least 1h before being subjected to AFM 
imaging. The instrument was operated in peak force tapping mode. A 
new AFMtip was used for each different sample. The peak force set point 
was adjusted manually in order to optimize the spatial resolution as well 
as to minimize sample damage. Scanning was performed in the hori- 
zontal direction and repeated in the 45° diagonal direction to exclude 


scanning artefacts. Micrographs were recorded digitally using Bruker 
Nanoscope software with 512 lines at a 0.5-Hz scan rate and corrected 
for background undulations using the in-software algorithm function. 


Data availability 


The data supporting the findings of this study are available within the 
paper and its Supplementary Information files. 
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Peptide Sequence 


N C 

1 Mal-DEKIKNM ADQIKHM AWMIDRM AEKIDRE A 

2 C-DEEIRRM AEEIROM AERIQOM AEQIQOE A 

3 Mal-DEKIKNM ADQIKHM AWMIDRM AEKIDRE A 

4 Mal-DEKIKNM ADQIKHM AWMIDRM AEKIDRE A 

5 Mal-REEIRRM AEEIRRM AREIERM AEBIERR A 

6 Mal-DEKIKNM ADQIKKM AWMIDRM AEKIDRE A 

il 
7 DEKIKNM ADQIKHM AWMIDRM AEKIDRE A 
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Extended Data Fig. 1| Structures of peptides 1-7. The single-letter amino acid 
sequences of peptides 1-7 are shown at the top, noting the peptides in which 
maleimide (Mal) has been chemically added to the N terminus, and those in 
which lysine amino acids (K) have been modified with the structures shown at 
the bottom. The unmodified amino acid sequences of peptide 1, peptide 2 
(without an N-terminal cysteine) and peptide 6 have previously been denoted 
P622_6 (ref.°), BNDL1 (ref. !°) and BNDL2 (ref. *”), respectively. 
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Extended Data Fig. 2 | The thiol-maleimide click reaction. R, andR, represent 
the possible remainders of the molecular structures. The thiol-maleimide 
reaction is catalysed by either a base or anucleophile (Nu) catalyst. 
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Extended Data Fig. 3| SANS of peptide rods with different linker 
chemistries. a, Scattering from rigid rods (blue squares) and the corresponding 
rigid cylinder fit (black curve). Rigid rods (identical to those in Fig. 1b) were 
assembled in 25 mM pH 6 phosphate buffer prepared in deuterated water at 

20 °C. b, Scattering from semi-flexible fibres (purple triangles) and the 
corresponding flexible cylinder fit (black curve). Semi-flexible chains (identical 
to those shown in Fig. le) were dissolved in 25 mM pH 6 phosphate buffer 
prepared in deuterated water at 20 °C. Ineach case, Ris the fitted cylinder radius 
of the corresponding model. 
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Extended Data Fig. 4 | Estimation of the persistence length of rigid-rod 
bundle chains. The estimation was carried out using FibreApp tracking and the 
analysis software of ref.”". a, CryoTEM of rigid-rod bundle chains. b, The same 
rigid-rod bundle chains after software tracking. The red trace is the last rod 
tracked with the software in the image and is used, along with the earlier blue 
traces, inthe stiffness analyses. c, Plot of calculated mean-squared end-to-end 
distance (MSED) between contour segments along the tracked rod (blue 
squares), and the corresponding MSED fit (black curve) between contour 
segments. This fit is based ona worm-like chain model in two dimensions with 
the following theoretical dependence: (R) = 4A[I - 2A(1- e-’“)], where Ais the 
persistence length and Ris the direct distance between any pair of segments 
along acontour separated by arc length. The persistence length is estimated to 
be33.4+0.4 um. d, Analternative method for estimating the persistence length 
of very stiff, one-dimensional objects is the mean-squared midpoint 
displacement (MSMD). This plot shows the calculated MSMD between contour 
segments along the tracked rod (blue squares) and the corresponding MSMD fit 
(black curve). This fit is based on an equation that describes the behaviour of a 
midpoint deviation along a rod: (u,”) = 2/48A, where (u,”) is the MSMD between 
any pair of segments along a tracked-rod contour, separated by an arc length J, 
assuming that the displacements are small in comparison with the 
corresponding arc lengths (|u,| is much less than /). This method estimates the 
persistence length of the rod to be 41.3 +0.5 pm. 
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Extended Data Fig. 5| Plot showing persistence length versus mass per unit 
length for various 1D polymers and molecular assemblies. The plot is 
adapted with permission from ref. '°, Springer Nature Limited. For peptide- 
bundlemer rigid rods, the persistence length was estimated from our cryoTEM 
data using the methods of ref.”". Other persistence lengths were taken from the 
following references: hydrocarbon polymers, ref. '°; fd and M13 viruses, refs. "2; 
DNA, refs. '*?*: actin, refs. *3°; tobacco mosaic virus, refs. °?°*°; microtubulin, 
refs. °°*°- thin (diameter 2-3 nm) and thick (diameter 4-6 nm) twisted bovine 
serum albumin (BSA) fibres, ref. 7°; B-lactoglobulin B-sheet-rich twisted fibrils 
(with diameters ranging from 1nmto6nm, witha mean of roughly 3 nm), 
ref.'!?2; amino acid B-sheet ribbons with low stacking (producing ribbon 
diameters of roughly 4 nm) and high stacking (producing ribbons with 
diameters of 4-8 nm), ref. °. 
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Extended Data Fig. 6 | Chemical structures of bundlemer organic linkers. 
a, Structure of PETMP for the formation of semiflexible or kinked chains. 

b, Chemical structure of four-arm PEG tetrathiol (20 kDa) for hydrogel 
formation. 
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Extended Data Fig. 7 | Hydrogel network rheology. Hydrogels were composed 
of peptides 1 or 5 (Extended Data Fig. 1) linked with four-arm PEG tetrathiol 
(Extended Data Fig. 6). We monitored the storage modulus G’ (filled symbols) 
and loss modulus G” (opensymbols) as functions of temperature (left) andasa 
result of the cycling of temperature (right); inthe latter case, temperature ramps 
were performed over 5 min between isothermal measurements of 2-min 
duration. a, Peptide 1, with 7,,=55 °C. Peptide 1 produces atemperature- 
reversible hydrogel owing to the low melting temperature of the designed 
peptide. b, Peptide 5, with 7,,=85 °C. The hydrogel produced with peptide Sis 
stable to approximately 85 °C and shows much more rigid gel properties at all 
temperatures tested. 
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Over the past eight hundred thousand years, glacial—interglacial cycles oscillated witha 
period of one hundred thousand years (‘100k world”). Ice core and ocean sediment data 
have shown that atmospheric carbon dioxide, Antarctic temperature, deep ocean 
temperature, and global ice volume correlated strongly with each other in the 100k 
world” °. Between about 2.8 and 1.2 million years ago, glacial cycles were smaller in 
magnitude and shorter in duration (‘40k world”). Proxy data from deep-sea sediments 
suggest that the variability of atmospheric carbon dioxide in the 40k world was also 
lower than in the 100k world®°, but we do not have direct observations of atmospheric 
greenhouse gases from this period. Here we report the recovery of stratigraphically 
discontinuous ice more than two million years old from the Allan Hills Blue Ice Area, East 
Antarctica. Concentrations of carbon dioxide and methane inice core samples older 
than two million years have been altered by respiration, but some younger samples are 
pristine. The recovered ice cores extend direct observations of atmospheric carbon 
dioxide, methane, and Antarctic temperature (based on the deuterium/hydrogen 
isotope ratio 5D,,.., a proxy for regional temperature) into the 40k world. All climate 
properties before eight hundred thousand years ago fall within the envelope of 
observations from continuous deep Antarctic ice cores that characterize the 100k 
world. However, the lowest measured carbon dioxide and methane concentrations and 
Antarctic temperature in the 40k world are well above glacial values from the past eight 
hundred thousand years. Our results confirm that the amplitudes of glacial—interglacial 
variations in atmospheric greenhouse gases and Antarctic climate were reduced in the 
40k world, and that the transition from the 40k to the 100k world was accompanied by a 


decline in minimum carbon dioxide concentrations during glacial maxima. 


Earth has been cooling, and ice sheets expanding, over approximately 
the past 52 million years (Myr)". Superimposed on this cooling are peri- 
odic changes in the Earth’s climate system driven by variations in the 
eccentricity (with periods of 400 and 100 kyr) and precession (23 and 
19 kyr) of the Earth’s orbit around the Sun, and the tilt of the spin axis 
(about 41 kyr). From around 2.8 to 1.2 Myr ago (Ma), the Earth’s climate 
system oscillated between glacial and interglacial states with a period 
of about 40 kyr (the 40k world’). Between 1.2 and 0.8 Ma, an interval 
known as the ‘mid-Pleistocene transition’ (MPT), the period of glacial 
cycles lengthened to about 100 kyr and glacial periods became colder”, 
for reasons that are poorly understood. The post-MPT glacial cycles are 
characterized by a quasi-100-kyr period (the 100k world’). 

Studies of stratigraphically continuous ice cores have shown that 
atmospheric CO, is directly linked to Antarctic and global temperature 
over the last 800 kyr**. The coupling of the Earth’s climate and carbon 
cyclein earlier times, however, has not been conclusively demonstrated. 
Recent boron-based reconstructions have suggested that glacial CO, 


declined across the MPT, but interglacial CO, remained stable*°. These 
studies suggest that the range of CO, was reduced in the 40k world, as 
was the amplitude of glacial cycles. On the other hand, CO, estimates 
based on foraminiferal 85°C predict that CO, varied between 300 and 180 
parts per million (ppm) in the 40k world®”, hinting ata fundamentally 
different relationship between CO, and temperature. As both of these 
techniques rely on indirect measurements of atmospheric CO,, they 
have lower accuracy and precision (typically more than + 20 ppm”; 
2s.d.(o)) than can beachieved by direct measurements inice core samples 
(+2 ppm; 20). Accurate and precise CO, measurements from ice cores 
that pre-date the MPT are therefore needed. 


Old ice records from the Allan Hills 


Million-year-old ice was recently discovered” in shallowice cores drilled 
in the Allan Hills Blue Ice Area (BIA), Antarctica (—76.73° N, 159.36° E; 
Extended Data Figs. 1, 2). In the Allan Hills, bedrock topography and 
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Fig. 1| Age-depth profile of Allan Hills ice cores. a, ALHIC1502 data in blue; 

b, ALHIC1503 datain red.c, *°Ar,,mages plotted against distance above bedrock 
withthe same colour coding as ina, b. Data in circles were measured at Princeton 
University’. New measurements madeat Scripps Institution of Oceanography are 
plotted as squares. Error bars represent the external reproducibility (1o;110 kyr 
and 220 kyr for samples measured at Scripps and Princeton, respectively) of the 
measurement or 10% of the sample age, whichever is greater. Shading highlights 
sections in whichice that is more than1 Myr oldis present. Sections affected by 
respiration are marked by the vertical arrows (blue, ALHIC1502; red, ALHIC1503). 


strong katabatic winds lead to the exhumation of old ice from depth to 
the surface’®. In 2015-2016, two additional cores were drilled. The first 
extended the drilling to a depth of 147 m at the BIT-58 borehole (named 
ALHIC1503 hereafter), where approximately 1-Myr-old ice had previously 
been discovered. The second (ALHIC1502), drilled 56 mto the southwest, 
reached bedrock at 191 m. These ice cores are dated by measuring the *°Ar 
deficit in trapped air relative to the modern atmosphere” (*°Aram), and 
represent an archive of the Earth’s climate extending well into the 40k 
world. A nearby blue-ice core (Allan Hills Site 27; S27) gives a continuous 
record from about 100 to 250 ka, and serves as a reference section for 
comparison with ALHIC1502 and ALHIC1503*°. 

Age-depth profiles of the ALHIC1502 and ALHIC1503 cores (Fig. 1) 
show that both sites contain 300-500-kyr-old ice, overlying a basal unit 
confined to approximately the bottom 30 m, with ages that range from 
more than 1 to 2.7 Myr. At ALHIC1503, the deepest sample lies within 
about 1 m of bedrock and has the oldest *°Ar,,n age, 2.7 + 0.3 (1o) Myr. 
However, stratigraphic disturbance in both ALHIC1502 and ALHIC1503 
is evident from abrupt age discontinuities (Fig. 1), accompanied by large 
swings in 80 of O, (6°0,,) and 5D,,. values”. In light of these compli- 
cations, we interpret the paleoclimate records from the Allan Hills BIA 
as discrete ‘snapshots’—discontinuous but accurate portraits—of the 
climate state, rather than continuous time series. 


Climate properties in the oldice 


Paleoclimate reconstructions from stratigraphically discontinuous ice 
sections face several challenges. First, there is the potential for sampling 
bias inthe preserved climate states due to possible differences in accu- 
mulation rate and/or ice rheology. Second, diffusion in million-year-old 
ice may lead to smoothing of paleoclimate properties or proxies’””°, 
though this effect is likely to be minor in Allan Hills ice (see Methods). 
Third, discrete sampling might not capture the complete glacial- 
interglacial range of properties of interest. Overall, we estimate that 
76 +14% (20) of the total atmospheric CO, variability in the 40k worldis 
recovered by our CO, samples. This estimate is based upon our deduc- 
tions that 90% of the true glacial—interglacial climate variability is 
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Fig. 2| Climate properties over the past 2.9 Myr documented inice core and 
benthic foram records. a, The continuous 800-kyr 5D,.. record from the 
European Project for Ice Coring in Antarctica, Dome C (EDC) ice core* (black 
line); the continuous S27 6D;.. record covering 120-250 ka’® (red line); and the 
discrete Allan Hills 5D,.. ice samples (red circles). b, The 800-kyr ice core CH, 
record” (orange line) and the binned Allan Hills CH, data (orange circle).c, The 
800-kyrice core CO, record*>*”5 (purple line) and the binned Allan Hills CO, data 
(purple circles). Note that there are no reliable CO, and CH, analyses inthe 
2.7-Ma bin (see Methods). d, The 800-kyr ice core 8°O,m record?” (brown line) 
and binned values of Allan Hills 6*80,,,, Samples (brown circles). All 680, Values 
are normalized to the modern atmosphere. e, The globally distributed benthic 
oxygen isotope stack over the Plio-Pleistocene (LRO4)” showing decreased 
glacial-interglacial variability and less extreme glacials before 800 ka. 


preserved in the Allan Hills ice, and that 84 + 15% (20) of CO, variability 
within the ice is captured by discrete samples, assuming alinear relation- 
ship between CO, and benthic 50 (see Methods). 

Measured climate proxies and ice core properties across time (Fig. 2) 
include proxy records of regional temperature (6D,,..), CH,, CO,, and global 
O, fractionation (8°0O,,,,)”. Samples older than 800 kyr are assigned the 
age of the nearest *°Ar,m-dated sample and then binned into two age 
groups: the MPT (800-1,200 kyr), and the 40k world (1.2-2.7 Myr). Within 
the 40k world bin, samples are further subdivided into three groups with 
average ages of 1.5, 2.0, and 2.7 Myr (see Methods and Fig. 2). 

Improbably high CO, concentrations or 6°C values of CO, indicat- 
ing contributions from respiratory CO, were present in all 2.7-Myr-old 
samples, common among 2.0-Myr-old samples, but absent from 1.5-Myr- 
old samples (see Methods). We chose —7%o as the cut-off value for 8°C 
and rejected all CO, samples below the shallowest 85°C sample with an 
isotopic value of lighter than —7%o. The application of these criteria 
excluded 21 CO, samples from our analysis. CH, and 60,.m samples 
were also rejected when 86°C or CO, measurements indicated that 
these properties were compromised. We continued to plot 6D,., val- 
ues of the ice affected by respiration in Fig. 2 in the 2.0 and 2.7 Myr age 
groups, but excluded 8D,,, data from anomalous CO, samples in the 
40k world. 
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Fig. 3 | Comparison of blue-ice CO, record and boron-based CO, 
reconstructions during and before the MPT. a, LRO4 benthic oxygen isotope 
stack?. b, Distribution of MPT CO, (left, orange) and pre-MPT CO, (right, red) 
data observed in Allan Hills ice. c, Comparison between ice core CO, (orange and 
red circles) and 6"B-based CO, reconstructions (purple and blue solid lines; 


Ice dating back to about 100-250 ka, sampled ata nearby site’®, records 
a6D,,, range of —280 to —360%o in the 100k world. 6D,,., values ranged 
from -284 to -316%o in 40k-world ice and from —288 to -332%o in MPT ice. 
Maximum values of 6D,,, were similar during all three time periods, lying 
within an 8% range. However, minimum values were higher in the MPT 
and the 40k world by at least 28%o. This implies that interglacial periods 
during the 40k world in Antarctica were not significantly warmer than 
those inthe 100k world, but that glacial maxima were lower after the MPT. 

CO, concentrations ranged from 221 to 277 ppm (n= 23)” in MPT ice, 
and from 214 to 279 ppm (n= 37) in ice from the 40k world. Given that 
76 +14% (20) of total CO, variability was captured by 37 discrete samples, 
and assuming that glacial and interglacial extremes are equally absent, 
the expected true range of 40k world CO, is 8613 ppm (20; Fig. 3). Our 
best estimate of 40k-world CO, concentrations is therefore 204-289 ppm. 
These ranges indicate that interglacial CO, concentrations in the 40k and 
100k worlds were similar, whereas glacial CO, concentrations are likely 
to have been 24 ppm higher in the 40k world than in the 100k world. By 
comparison, reconstructions of atmospheric CO, from individual boron 
isotope measurements in foraminifera ranged from 190 to 320 ppm 
(Fig. 3), with the average extreme values being 234 and 277 ppm”. 


Implications for climate evolution 


Our findings appear to be inconsistent with hypotheses that attrib- 
ute the transition into the 100k world to a long-term decline in both 
interglacial and glacial atmospheric CO,””. Our data instead support 
hypotheses that link the MPT to greater ice sheet size and CO, drawdown 
during glacial maxima. Those hypotheses include, but are not limited to, 
enhanced dust delivery to the southern ocean” and changes in global 
ocean circulation® that resulted in an additional drawdown of atmos- 
pheric CO,, thereby enhancing the growth of continental ice sheets and 
leading to the emergence of 100-kyr glacial cycles. 

Compared to CO, concentrations and 6D,,., CH, concentrations and 
&'8O.,.m Values in ice from the MPT and 40k world exhibited even less 


dashed lines represent the 95% confidence intervals)”. Shading represents the 
expanded range estimate, given the possibility that discrete Allan Hills CO, 
samples may not fully capture the true range of CO, variability. Black dashed 
lines represent the glacial-interglacial range of CO, inthe 100k world’. 


glacial-interglacial variability. CH, concentrations ranged from 405 
to 569 parts per billion (ppb) in MPT ice (n = 31) and 457 to 592 ppbin 
40k-world ice (n = 16). These ranges are only about 40% of the range 
in the 100k world (approximately 350 to 800 ppb)”. Undersampling 
may play arolein this reduction, as CH, maxima in the 100k world have 
short durations”®. In addition, discrete CH, measurements require 
more ice than CO, measurements (60 g versus 10 g). They therefore 
average over a longer interval of time than CO, samples. 6'°O,,,, Val- 
ues ranged from 0.01% to 0.92% in MPT ice (n = 63) and from 0.00%o 
to 0.33% in 40k-world ice (n = 29). In comparison, the range is from 
—0.30%o to +1.48%o in ice from the 100k world”. The 6'%0,,,,, values in 
ice from the 40k world are remarkable in that they lack values higher 
than +0.5%o and the total range of &'0,,,, Values is only about 20% of 
that observed in ice from the 100k world. Smaller changes in seawater 
880 values (Fig. 2) help to explain the absence of 670,,,, values higher 
than +0.5%o. However, additional mechanisms are likely to be needed 
to explain the lack of precession-modulated variability in 6°O,,,, aSSOCi- 
ated with biosphere productivity and changes in the low-latitude water 
cycle”. 

Plots of atmospheric CO, against 5D,,. and CH, against 5D,,.. (Fig. 4) 
show that samples fromthe MPT and 40k world fall within the envelope 
of the 100k world but exhibit only a fraction of its range. Samples from 
the MPT and 40k world do not have the isotopically light 6D,,.. and low 
CO, values characteristic of glacial samples in the 100k world. Both CO, 
and CH, are positively correlated with 5D,,., in ice from the 100k world 
(S27). By contrast, in the MPT and 40k world, only 6D;,. and CO, are 
significantly correlated (P= 0.002 and P= 0.003, respectively). For all 
three sample sets, the slopes of the correlation between CO, and SD,,.. 
are statistically indistinguishable (Fig. 4), in agreement with previous 
reconstructions from boron-based proxies and benthic 680 values. 
Together, these observations suggest a consistent and persistent cou- 
pling between Antarctic climate and atmospheric CO, throughout the 
Pleistocene®”’, and that glacial cycles in the 40k world are truncated 
versions of glacial cycles in the 100k world. 
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Fig. 4| Cross-correlations between climate properties. a, CO, versus 5D;-.. 

b, CH, versus 6D,,.. Data points are colour-coded by age: Black, Vostok CO, or 
CH, data? projected onto coeval Allan Hills 5D,,, from $27; orange, Allan Hills 
MPT data; red, Allan Hills 40k world data. Shading represents the range of 6Dic. 
we observed in the 40k world. 5D,.. measurements in samples measured for CO, 
span 94% of the total range of 5D,.. observed in 40k world samples, and therefore 
the CO, samples represent nearly the full range of climate variability recorded in 
the cores. There are significant correlations between CO, and 6D,,. in all three 
age units, and their regression slopes (ppm by %v; solid lines) are statistically 
indistinguishable: 1.33 + 0.17 (20; r= 0.78), 1.14 + 0.68 (r=0.61), and 0.90 + 0.56 
(r=0.48) for 120-250 ka, MPT, and pre-MPT age units, respectively. For 
comparison, a significant (P< 0.05) correlation between CH, and 6D,,, exists 
only at S27 (3.56 + 0.65; r=0.61). 


Conclusions 


Shallowice cores drilled in the Allan Hills BIA providea direct observation 
of the variability in atmospheric CO, during the 40k world, confirming 
previous findings that minimum CO, concentrations declined after the 
MPT®”. These results complement plans to drill a continuous ice core 
record back to around 1.5 Ma”. These snapshots of the Earth’s climate 
system from shallow ice cores in BIAs provide an incomplete picture, 
especially with regards to the dynamics. Nevertheless, the oldest ice will 
probably be found in discontinuous sections. Our work demonstrates 
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that BIAs can be exploited to extend climate records, including atmos- 
pheric CO, concentrations, well back into the early Pleistocene and 
perhaps even the Pliocene. 
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Methods 


Sample location and description 

Allan Hills Blue Ice Area (BIA). The Allan Hills BIA is located -100 km 
to the northwest of the McMurdo Dry Valleys. Drilling sites are located 
to the west of the Allan Hills nunatak in the Main Ice Field (MIF) of the 
Allan Hills BIAs (Extended Data Fig. 1). Here, net ablation leads to the 
exposure of ancient glacial ice at the surface of an ice sheet”. Elevation, 
bedrock topography, and ice velocities in the MIF have been previously 
documented’. Site meteorology and surface snow properties are 
discussed in more detail by Dadic et al.* and ice and gas properties 
described in Higgins et al.®. 


Site ALHIC1503 (previously named BIT-58). Site ALHIC1503 
(76.73243° S, 159.3562° E) is located near acrest ofanorthwest-southeast 
trendingice ridge off the MIF. The precise direction and the velocity of ice 
flow are not known at this location. Measurements from the nearby ice 
field are consistent with primary flow to the east or northeast. Complex 
dust bands are visible from high-resolution satellite imagery (Extended 
Data Fig. 1, left), implying a complex history of glacial flow. Typical ice 
flow velocities near ALHIC1503 are low (0.015 m yr“). 

Bedrock topography, determined using ground penetrating radar 
(GPR), indicates that ALHIC1503 is located ona steep (~45°) slope lead- 
ing to a local bedrock high with an overlying ice thickness of ~150 m 
(Extended Data Fig. 2). In the 2015-2016 field season, 23 m of ice was 
drilled, extending the earlier 124-m-long ice core” to the bedrock at 
147 m below the ice surface. 


Site ALHIC1502. Site ALHIC1502(76.73286°S, 159.35507° E) islocated 56m 
upflow from ALHIC1503. The ice thickness is ~204 m, consistent witha 
steep bedrock slope (Extended Data Fig. 2). Drilling at site ALHIC1502 
recovered 197 m ofice. 


Site 27 (S27). S27 (76.70° S, 159.31° E) is located along the mainice flow 
line of the Allan Hills region. Two hundred and twenty-four metres of ice 
was retrieved, representing ~70% of the estimated column thickness. 
Spaulding et al.'® provide a comprehensive suite of stable water isotope 
and gas analyses for S27. The S27 ice chronology has been established by 
matching 6D,,, features to those of the Vostok ice core. This timescale 
was validated by correlating variations in 5O,,,, (6'°O of paleoatmos- 
pheric O,) into the record of 8°0,,,, in Vostok ice cores®. 


Analytical procedures 

4° Ar stm and SXe/Kr. *°Ar is produced in the solid earth by the radioactive 
decay of *°K. As *°Ar slowly leaks into the atmosphere, its concentration 
increases with time. “Ar and **Ar, on the other hand, are stable, primordial, 
and non-radiogenic, so their atmospheric concentrations are treated as 
constant. The ratio of *°Ar/*Ar thus rises towards the future and decreases 
towards the past. The term of merit hereis the gravitationally corrected paleo- 
atmospheric *°Ar/*Ar ratio: Af gim= 5° Ar/*Ar—8Ar/**Ar. 

The latter term corrects for the gravitational fractionation in the firn 
column®. Studies of °Ar,,m as a function of age in the Dome C and Vostok 
ice cores constrained the rate of change to be 0.066 + 0.006% Myr“, 
allowing us to date the air trapped in the ice without the prerequisite 
for stratigraphic continuity”. 

A potential complication of *°Ar,,,, dating comes from the in situ 
production of *°Ar from the bedrock and/or from dust, a tiny 
amount of which is present in polar ice cores. The deepest sample 
at ALHIC1502 has a much younger age (0.8 Myr) than the overlying 
ice (2.2 Myr). This inverted age—depth relationship may result either 
from folding or from radiogenic input of “Ar from the “°K in the underlying 
bedrock, a phenomenon observed for basal ice at GISP2 in Greenland”. 
The “°K from dust in the ice would produce a negligible amount of *°Ar. 

Ar isotope analyses on the original BIT-58 core between the surface 
and 126-m depth were carried out at Princeton University with a proce- 
dure modified from Bender et al.” and described by Higgins et al.>. The 


standard deviation of *°Ar,,,, measured across 68 external standards 
is +0.014%o, corresponding to an age uncertainty of +210 kyr (1o). For 
Allan Hills samples drilled in 2015-2016, *°Ar,,,, Was measured at Scripps 
Institution of Oceanography with slight modification. First, sample size 
was increased from 500 to 800 g for better precision. Second, pumping 
time for evacuating the ice-bearing vessel before gas extraction was 
reduced to 15 min to prevent gas loss. Third, extracted and purified gases 
were measured ona newer mass spectrometer, which offers better signal 
sensitivity and stability. Finally, following published procedures**, we 
measured the Xe/Kr ratios, expressed as 5Xe/Kr, an indicator of mean 
ocean temperature”, in the same aliquot of the extracted and getterred 
gases. A typical *°Ar,,, and 8Xe/Kr measurement takes ~3.5 h on the 
mass spectrometer. 

Two measurement campaigns at Scripps Institution of Oceanography 
were carried out and each was associated with slightly different ana- 
lytical procedures. In the first campaign, a fixed amount of reference 
gas was introduced into the bellow of the mass spectrometer, leaving 
different amount of gas in sample and reference sides. As the sample 
and reference gases deplete during analysis, the pressure and ion cur- 
rent fall more rapidly for the side containing less gas. Asa result, *°Ar,.in 
was observed to depend on the volume of the analyte relative to the 
reference. All measured *°Ar,,,, values were subsequently corrected for 
volume differences, using the voltage readings of the mass-40 (*°Ar) 
beam when the gases were introduced to the fully expanded bellow on 
the mass spectrometer. The magnitude of this correction is generally less 
than 0.02%o, or 300 kyr, and the uncertainty in the correction is much 
smaller. Inthe second campaign, the amount of gas in the standard-side 
bellows was matched to that of the reference-side bellows. No volume 
correction was applied in the second campaign. Based on replicate 
analyses, the external reproducibility was +0.006%o (10) and +0.007%o 
(10) for dry La Jolla air samples and Taylor Glacier blue-ice samples, 
respectively. The samples measured at Scripps therefore have an age 
uncertainty of +110 kyr (10) owing to the analytical uncertainty, or 10% 
of the sample age owing to the uncertainty in the *°Ar outgassing rate, 
whichever is greater. Individual sample uncertainties are reported along 
with the measured values. 

Xenon-to-krypton ratios (6Xe/Kr) were measured immediately after 
the *°Ar,mn Measurements on the same mass spectrometer by ‘peak 
hopping’ between “Kr and Xe. The 8Xe/Kr measurement takes about 
halfan hour to complete. The final reproducibility is +0.21%o and +0.27%o 
for dry LaJolla air samples and Taylor Glacier ice replicates, respectively. 
In order to calculate mean ocean temperature from 6Xe/Kr, which is nor- 
malized to LaJolla air, we need to make an assumption about the volume 
of the ocean”. For this purpose, we use a sensitivity of 1.1+ 0.2 °C/%o. We 
obtain this value from the observed last glacial maximum (LGM)-present 
8Xe/Kr change” with minor modifications. We adopted all the same sea- 
level-related corrections as in ref.*° except one: the assumption that Kr 
and Xe saturation state was higher during the LGM than at present. This 
assumption lowers the sensitivity of mean ocean temperature (MOT) to 
8Xe/Kr from 1.2to 1.0 °C/%o (for example, a MOT change of 2.57 + 0.24 °C 
for a2.5%o change of 8Xe/Kr”). To account for this uncertainty we adopt 
an intermediate value of 1.1and a larger error of +0.2. 


5D,,.and 5"°O,,. Stable water isotope measurements (5'%0,.., 5D;..) Were 
taken from a slab of ice, 2.5-cm wide, cut (parallel to the vertical axis) 
from the outside of each ice core. Because some portions of the core 
were fractured, continuous sections were not available at all depths. 
These slabs were sub-sampled at 10-15-cm resolution at the Climate 
Change Institute, University of Maine. Each sub-sample was melted in 
asealed plastic bag at room temperature, vigorously shaken, and then 
decanted into an 18-ml plastic scintillation vial. All vials were refrozen 
and stored below-10 °C until the time of analysis. Water isotope samples 
were measured via cavity ring-down spectroscopy (CRDS) using a Picarro 
Model L2130-i Ultra High-Precision Isotopic Water Analyzer coupled 
with a High Precision Vaporizer and liquid auto sampler module. 6%0,,, 
and &D,,, were measured simultaneously with an internal precision (20) 
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of 0.05% for 6'80,,., and +0.10%o for 6D,,.. The instrument drift adjust- 
ment and daily calibration were accomplished by periodically measuring 
three secondary standards: BBB (an average Maine freshwater fromthe 
Bear Brook watershed), ASS (Antarctic surface snow) and LAP (light 
Antarctic precipitation). These standards were calibrated against the 
internationally accepted water isotope standards: SMOW (standard 
mean ocean water), SLAP (standard light Antarctic precipitation) and 
GISP (Greenland ice sheet precipitation). 


5®N, 5°O.4m: and 5O,/N,. Analyses for the original BIT-58 O,/N,/Ar el- 
emental ratio, 80 of O,,and &°N of N, were carried out as described*””. 
Samples from ALHIC1502 and ALHIC1503 were measured using a slightly 
modified procedure. In brief, ~20 g of ice was placed into a glass flask 
chilled in a dry-ice-isopropanol cold bath (-78.5 °C) and the vessel was 
turbo-pumped for 3 min. The final pressure of the residual gas inside the 
vial was no higher than 1.5 mTorr. The pressure of the air trapped inice, 
when expanded to the same space, was ~3 Torr. The reduced pumping 
time thus introduces, at most, about 0.05% of modern atmosphere into 
the sample gas. 

After melting, the gas—meltwater mixture was equilibrated for at 
least 4h. The majority of the meltwater was drained and leftover water 
inside the glass vial refrozen at -30 °C. Once all meltwater had refrozen, 
headspace gases were collected by condensation at liquid helium tem- 
peratures. The sample was admitted to a mass spectrometer (Thermo 
Finnegan Delta Plus XP) for elemental and isotope ratio analysis after 
homogenizing at room temperature for 30 min. The reproducibility of 
6°N, 60,/N, and 680, Values from ALHIC1502 and ALHIC1503 samples 
analysed using this procedure, calculated as pooled standard deviation, 
is +0.008%o, +3.164%o, and +0.024.%o, respectively. 

Overall, the reduced pumping time leads to a reduction in gas loss 
and an improvement in precision. We note that the gas loss processes 
that alter 60,/N, did not appear to affect the isotopic composition of 
O, and N, in our samples (Extended Data Fig. 3). As a result, no gas loss 
correction was applied to the gas isotopes, in contrast to some previous 
studies in heavily fractured ice (for example, Siple Dome*’). All 6°N, 60,/ 
N, and 880,.m Values were normalized to modern atmosphere. 


CH,, CO, and 5°C of CO,. CH, concentrations were analysed at Oregon 
State University using a melt refreeze technique“. Samples (-60-70 g of 
ice) were trimmed, melted under vacuum, and refrozen at about -70 °C. 
CH, concentrations in released air were measured by gas chromatog- 
raphy and referenced to air standards calibrated by National Oceanic 
and Atmospheric Administration Global Monitoring Division (NOAA 
GMD) onthe NOAA04 scale. Precision was generally better than +4 ppb. 
Individual sample uncertainties, where available, are reported along 
with the measured values. 

CO, concentrations, referenced to air standards calibrated by NOAA 
GMD on the World Meteorological Organization (WMO) scale, were 
measured using a dry extraction (crushing) method”. Wherever pos- 
sible, ice samples were processed and analysed in replicate for each 
depth and results averaged to obtain final CO, concentrations. However, 
four pairs of replicates did not come from exactly the same depth and 
their reproducibility showed substantial deterioration beyond usual 
analytical uncertainties. In this case, we treated those four pairs of rep- 
licates as eight individual, unreplicated data points. For greenhouse gas 
concentrations, no gravitational fractionation correction was applied; 
the correction would be less than 0.8 ppb for CH, and 0.5 ppm for CO,. 

The measurement protocol for °C of CO, was as described* with 
improvements inthe efficiency of air extraction. Approximately 200 g ofice 
was dry-extracted witha custom-made ‘ice grater’. The reproducibility of 
the method, based on replicate analysis of Taylor Glacier blue-ice samples, 
is 0.02%o for 6°C-CO,. The method also provides a measurement of CO, 
concentrations. The uncertainties are +2 ppm (lo of the pooled standard 
deviation). The final °C values, normalized to the Vienna-Pee Dee Belem- 
nite (VPDB) standard, were corrected for blanks, N,O* mass interference 
from N,O, and gravitational fractionation using the nearest SPN value. 


Age assignment of CO,/CH, and O,/N,/Ar samples 

The depths of samples analysed for CO,/CH, and O,/N,/Ar were differ- 
ent from depths of the *°Ar,,,, Samples that provided the chronology. 
CO,/CH, samples, O./N,/Ar samples, andice samples were assigned ages by 
bracketing *°Ar,.n Values. Here we discuss two different models for assign- 
ing ages and showthat different criteria for age assignment lead to similar 
conclusions. For the purpose of investigating the relationship between 
greenhouse gases (CO, and CH,) and Antarctic climate (6D,,..), we binned 
samples into three units: MPT, 800-1,200 ka; pre-MPT, >1,200 ka; and young 
ice, <800 ka. We focus onthe robustness of ages for CO, and CH, samples. All 
CO, and CH, data discussed below exclude samples affected by respiration. 


Conservative approach. In the most conservative approach, an age 
was assigned only to samples bracketed by two “°Ar,., ages lying with- 
ina single age bin. Otherwise, the sample was left ‘unassigned’. In this 
binning scenario, 19 CO, samples were classified as MPT, 15 as pre-MPT, 
and 34 as unassigned. We classified 28 and 19 CH, samples as MPT and 
unassigned, respectively. Eight CH, samples were binned into pre-MPT. 


Proximity approach. In this approach, each CO,/CH, datum was 
assigned the *°Ar,,,, age of the closest Ar isotope sample. In this age 
model, 23 and 37 CO, samples were binned into the MPT and pre-MPT 
units, respectively. For CH,, 31 samples were classified as MPT, and 16 
samples as pre-MPT. Finally, eight CO, and eight CH, samples were 
binned into the younger (<800 ka) age unit. 

Extended Data Fig. 4 summarizes the CO,-6D,,, and CH,-8D,.. 
relationships using the two age models. Our conclusions regarding the 
greenhouse gas-8D,,, relationship are not dependent on which model 
is used. As the proximity approach assigns an age to every depth, we 
adopted it for our interpretations. 


Assessing the fraction of climate variability recovered by 
discrete sampling 

When interpreting data from stratigraphically disturbed ice, the ques- 
tion arises as to what fraction of the amplitude of climate variability is 
accessed in the suite of samples analysed. Here we consider two factors: 
(1) the fraction of climate variability preserved in the ice cores compared 
to the true natural variability; and (2) to what extent discrete samples 
capture the variability that is preserved in the ice. Below we discuss 
these two factors separately. 


Fraction of the pre-MPT 5D,,, range preserved in the ice. We start 
by considering what fraction of the true glacial-interglacial variability 
in the 40k world is preserved in the ice. If we assume the absence of 
post-depositional alteration processes suchas respiration and diffusive 
smoothing, this fraction should be an intrinsic property of the ice itself 
and insensitive to the property being measured. Given the large number 
(>500) and the high spatial resolution (10 cm) of 6D,,. measurements, 
we used 8D,,, to evaluate the fraction of true 6D,,. variability preserved 
in the ice. Next, we needed to make an assumption of the true 6D,,. 
amplitude in the 40k world. It is assumed that the true amplitude of 6D, 
variations in Allan Hills ice scales linearly with the amplitude of deep 
ocean temperature. This assumption is justified for two reasons. First, 
most of the bottom water formation today occurs in the southern ocean. 
Second, Antarctic temperature has been found to correlate tightly with 
mean ocean temperature during the LGM-Holocene transition” and 
with South Pacific bottom water temperature through the past 800 kyr“*. 

For deep ocean temperature, we considered two scenarios. First, we 
assumed a constant partitioning between ocean temperature and the 
influence of seawater 50 on the benthic foram 680 record through- 
out the Pleistocene. This means that the amplitude of &D,.. should 
simply scale with the benthic foram 680 values. The amplitudes of gla- 
cial cycles in the early and mid-Pleistocene (between marine isotope 
stages (MIS) 39 and 104), expressed in benthic foram 680, averages 
0.70%o, representing 37% of the change from MIS 6 to MIS 5e (1.90%0). 


The range of 5D,,. values between MIS 6 and MIS Se observed in the Allan 
Hills ice from the 100k world was 80%o. Multiplying this 1OOk-world range 
by the scaling factor (37%) based on benthic foram 80 predicts the 
range of 6D,,, in 40k-world ice to be 30%o. By comparison, the range of 
40k-world 6D,,. observed in the Allan Hills ice is 32%o. Using this approach, 
we estimated that our pre-MPT ice samples represent ~100% of the aver- 
age amplitude of glacial-interglacial 5D,.. cycles in the 40k world. 

In an alternative scenario, we considered deep ocean temperature 
inferred from a seawater 6'80 record constrained independently from 
records of carbonate microfossil 880 from the Mediterranean Sea*. 
Aslightly higher variability in the 40k world deep ocean temperature, 
which corresponds to 50% of the average variability in the 100k world, 
lowers our estimate of the recovered climate variability to -80%. Conse- 
quently, although other factors might influence the recovery of the range 
of climate variability, given these two approaches our best estimate is 
that the Allan Hills ice samples presented here preserve 90% of the true 
range of the climate variables in the 40k world. 


Fraction of the pre-MPT CO, and CH, range recovered by discrete 
samples. CO, and CH, concentrations were measured on a relatively 
small number of samples. It is therefore possible that we have not 
retrieved the full range of variability of these properties in the 40k world, 
evenif their true variability is well preserved inthe ice cores. We evaluated 
this possibility by first generating a synthetic CO, series based upon the 
assumption that the variation of CO, in the 40k world scales linearly with 
benthic foram S80 values. A synthetic CO, time series between 1.2 and 2.0 
Ma was constructed on the basis of the LRO4 global benthic foram 880 
stack?. A linear regression between benthic 80 and atmospheric CO, 
between O and 800 ka resulted in correlation parameters that were used to 
create an artificial atmospheric CO, record further back in time (Extended 
Data Fig. 5a, b). After resampling to interpolate the synthetic data at atem- 
poral resolution of 2.5 kyr, a Monte Carlo simulation randomly selected 
37 samples from the synthetic CO, record. The simulation was repeated 
10,000 times. We then calculated the observed range of the 37 random 
CO, samples and compared it to the ‘true’ range, yielding a ratio for each 
simulation run. Finally, the distribution of this ratio is shown asa series of 
histograms (Extended Data Fig. 5c-h). Note that the term of merit for each 
simulationisthe ratio of the sampled range to the hypothetical true range. 

To account for the fact that interglacial accumulation rates are gener- 
ally higher than glacial rates*°, we multiplied the modelled occurrence of 
interglacial ice (CO, >250 ppm) by2, 4, 8, 16, and 32. When the presence 
of theinterglacial ice is two times the glacial ice, 84 +15% (95% confidence 
interval) of the total CO, range can be captured by 37 samples (Extended 
Data Fig. 5d). This ratio was derived from Vostok, an inland coring site 
of East Antarctica*’”. 

We note that 35 out of the 37 40k-world CO, samples involve replicates 
measured at the same depth. The assumption here is that replicates at 
the same depth have exactly the same age, which remains unevaluated 
in Allan Hills ice cores. It is possible that the ice flow and dip ofice layers 
make samples from the same depth different in age. If this is the case, 
each pair of replicates should be treated as two individual measure- 
ments, and the total number of samples would be 72. In this scenario, the 
recovered range is from 214 to 281 ppm, and 72 discrete samples capture 
88 + 13% of the CO, variability preserved in the ice. Treating replicates 
as individual samples does not significantly change the range (from 65 
to 67 ppm), but it increases the likelihood that we are recovering close 
to the full range of CO, in the ice (from 84% to 88%). 

Considering the discussion above, our best estimate is that the 37 
CO, samples cover 76 + 14% of the true CO, range in the 40k world 
(90% of true variability preserved in the ice multiplied by 84+ 15% that is 
captured by discrete sampling). Furthermore, we note that the 5D;,. 
values of the ice samples analysed for CO, as well as CH, span more than 
90% of the observed 40k-world 6D,,, range (Fig. 4). This number is higher 
than the estimate yielded by Monte Carlo simulations, meaning that the 
discrete samples are likely to capture more variability than probability 
alone would dictate. As a result, there is a greater chance that we have 


captured a significant fraction (>90%) of the glacial—-interglacial CO, 
range in the 40k world. 

For 680, Values the estimated recoverability should be no less than 
70%. First, 29 samples were analysed for 6'80,,,,. Second, the co-depth 
8D,.. values of 8%O,+m Samples span 97% of the 40k-world 6D,,. (Extended 
Data Fig. 6a). 


Potential sample mixing on various length scales 

Mixing by molecular diffusion. The interval between 123 and 141m 
in ALHIC1503 shows unexpectedly low variability in the elemental and 
isotopic ratios of the major gases (Extended Data Fig. 7). Molecular 
diffusion inthe firn column has been shown to reduce annual variability of 
water isotopes“ and gases”. In polycrystalline ice, diffusive mixing has also 
been invoked to explain the lack ofsub-millennial 6D, variability preserved 
inthe EPICA Dome Cice core, dated -~780 kyr old”. When annual layers are 
thin and temperatures are high, some gas records (for example, 60,/N,) 
can also undergo extensive diffusive smoothing and are expected to lose 
even the glacial-interglacial variability”°. 

However, diffusive smoothing of climate records is unlikely to be 
significant at the Allan Hills BIA. First, current temperatures within the 
boreholes at the Allan Hills BIA are around —30 °C. Rates of diffusion 
at the Allan Hills BIA should be substantially slower than at the warm 
basal temperatures (>-10 °C) expected at the bottom of thick (>3 km) 
polar ice sheets. Second, as is shown below, water isotopes (8D,,,) are 
expected to be much more sensitive to diffusive smoothing than gases 
(O, andN,). As a result, the presence of substantial sample-to-sample 
variability in 6D,,, values measured at 10-cm resolution indicates that the 
effects of diffusion on this and longer length scales are likely to be minor. 

Below we quantitatively estimate the characteristic diffusion time 
scale of water isotopes and of oxygen and nitrogen gases in the ice. 
Gas and water permeation coefficients in the ice have been estimated 
from molecular dynamics simulations°! and limited observations in 
some ‘natural experiments*?~’. Values obtained from different methods 
can differ by more than one order of magnitude. In our calculation, the 
fastest permeation parameters are used whenever possible to provide 
the most rigorous test of diffusive smoothing. 

For the self-diffusion of water molecules in the ice, the diffusion time 
scale (T;..) depends solely on the diffusivity of water molecules in ice 
(D,..) and the length of interest (L; 0.1m): 

P 


Tice = D... (1) 
ice 


The diffusivity of ice is determined by™: 


Q. 
Dice = Dire * exo|- 7 (2) 


where D2, (9.1x 10 m’s”) isa constant of diffusion for water molecules 
inice, Qico (5.86 x 10*) mol") the activation energy of diffusion, R (8.314 
Jmol™K7*) the gas constant, and 7 the temperature. The characteris- 
tic time scale of self-diffusion in ice as a function of temperature is 
shown in Extended Data Fig. 8. Depending on temperature, it takes 10° 
to 10’ years for diffusive smoothing to smear the original signal on the 
length scale of 0.1m. 

Next, we consider the molecular diffusion of O, and N, in the ice. 
Belowis a simple model of gas permeation witha large reservoir of gases 
(that is, bubbles) and a medium through which diffusion takes place 
(that is, ice): (1) First, gases in the reservoir dissolve into the ice. The 
amount of dissolved gas depends on ice volume, pressure, and solubility. 
(2) Next the gas will diffuse in the ice driven by the concentration gradient. 
This is assumed to be the rate-limiting step. (3) Diffusive gas exchange 
in the ice along the concentration gradient will immediately be com- 
pensated by the gas exchange between the ice phase and the bubble 
phase. (4) Eventually the concentration gradient in the bubbles will be 
balanced owing to diffusive flux. 
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Were there no gas reservoirs, the characteristic diffusion time scale 
of gas species m would simply follow the diffusivity of gas m in ice (D,,) 
and the length of interest (L): 


Tn= (3) 


where L is 0.5 m because the spatial resolution of gas measurements 
is about 50cm. 

Similar to the diffusivity of water isotopes, the diffusivity of gas m 
follows™: 


Dy = D°, x exp| “a 


Q.. 
zi (4) 


where D9 isaconstant and Q,, cis the activation energy of diffusion. 

However, the presence of the reservoir would compensate for the 
diffusive flux. Here, we introduce the concept of partitioning function 
Z, the ratio of the number of gas molecules in the bubbles to the number 
of gases dissolved in the ice. 


ik 
Tn=Zin* (5) 
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By definition, Z of a given gas species mis 


n 
Lge (6) 
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where Nn gas AN Np ice are the number of gas molecules in the gas phase 
and in the ice phase, respectively. 

Itis assumed that Nn gas > Mmice: Lhe number of gas m molecules in the 
gas phase can be inferred from the molar fraction of gas m (f,,) and the 
total number of gases in the ice (n,,,), which has indeed been measured 
as the gas content of the ice (V), expressed in SI units m? kg™: 


Mm gas = "gas * fy (7) 


Given the ideal gas law, equation (7) can be re-written as: 


PVmice 
RT® 


xf, (8) 


Nm, gas = 


where m,,, is the mass of the ice, p° is 101,325 Pa and T° is 273 K. 

The number of molecules of gas m in the ice phase depends on the 
solubility of gas m(S,,), the partial pressure of gas m(P,,,),and the number 
of water molecules (N..): 


Nm ice = Sin x Pn s Nice (9) 
The solubility of gas mis governed by: 


Sm= 52x exo|- 9 (10) 


where S° [Pa] is the dissolution constant, and Q,, the activation 
energy of dissolution. $5, and Sy, are 3.7 x 10 Pa“ and 4.510" Pa‘, 
respectively; Q, and Q, are 9,200 J mol” and 7,900 J mol”, 
respectively®, * : 

The partial pressure of gas m is the product of the bubble 
pressure, which is assumed to be the hydrostatic pressure of the 
depth in which bubbles are located, and the molar fraction of the gas 
minair: 


Pra= ice *8* 1) Xf, (11) 


where 7, is the density of ice (920 kg m*°), g the gravitational accelera- 
tion constant (9.8 ms“), and f the depth. In our case, his assumed to 
be aconstant 150 m. 

The number of water molecules n,,. can be directly calculated from 
the mass of the ice: 


ice (12) 


where M, are: iS the molecular weight of H,O (0.018 kg mol’). 
Therefore, the final expression for Z,,, is: 


Zn aro “Jn (13) 
m s.x xoxhxf. x ise 
m Pice & ‘= Mwater 
10) 
= p * Vx Myater 
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Using equations (4) and (14) and relevant parameters, the diffusion time 
scale of O, and N, are computed as a function of absolute temperature 
(Extended Data Fig. 8). Unless at very cold temperatures (<230 K), water 
isotopes measured every 10 cm should be more susceptible to diffusive 
mixing than gases measured every 50 cm. As a result, the preservation 
of 8D,-. variability argues that flat 80,/N, and 6'°0,.m profiles are not the 
result of diffusive smoothing. However, if the temperature is indeed that 
low, the characteristic time scale of gases becomes too large (>10’ yr) 
for substantial smoothing to occur in million-year-old ice. We did not 
calculate the permeation coefficients for CH, or CO, as these molecules 
have larger molecular diameters than O, and N,, and should diffuse at 
aneven slower rate and havea longer 1 (ref. ”). 


Mixing within individual samples. In polar ice sheets the thickness of 
annualice layers declines with depth owing to compression and lateral 
flow. Thinning will reduce geochemical variability if a property changes 
over timescales shorter than that encompassed by the physical length of 
asingle sample. This problem is further amplified in stratigraphically dis- 
turbedice, as the orientation of the layering is uncertain. CO, and O,/N,/ 
Arsamples were cut from smaller lengths of ice than 8D,,. samples (which 
were 10-cm long). Therefore, assuming the layering is horizontal, CO, 
and O,/N,/Ar samples should average shorter time intervals than is the 
case for 5D,..samples. On the other hand, CH, samples were cut in10-cm 
lengths. Because 6D,,, samples show large variability, we consider that 
gas properties other than *°Ar,,,, and 5Xe/Kr represent relatively short 
climate intervals without much mixing. The 6D,,, samples themselves 
may sometimes be mixed, and this possibility remains to be examined 
by the analysis of samples smaller than 10 cm. 


Flow-induced mixing. Physical mixing of ice of different ages can result 
from glacial flow. Mechanical mixing of this type may create an artifi- 
cial correlation between properties that are otherwise uncorrelated. 
Granted, many climate properties will co-vary in the absence of mixing. 
Therefore, property-property correlations do not necessarily indicate 
that mixing has occurred. 

Extended Data Fig. 9a shows the Pearson correlation coefficient 
(expressed as R’) between 8D,,. and d within intervals of varying lengths. 
In normally ordered ice cores, dis found to have a complex relation- 
ship with 6D,,.°°. We calculate R’ as a function of the number of 8D,,. 
samples included. The section between 132 and 142 mshowsa very strong 
correlation between 6D,,. and d, which raises the suspicion of 
mixing. 

We next examined the gas records between 132 and 140 m. The maxi- 
mum and minimum CO, values within this 10-m section are 253 and 
217 ppm, respectively. Because the gas content in the deep ALHIC1502 


and ALHIC1503 samples is very similar (-0.098 cm? g"), a simple linear 
mixing curve in the gas phase is expected. If we regard these two points as 
end members and plot 6D,,,. against CO,, the data points do not fall onto 
a mixing line (Extended Data Fig. 9b). Similarly, it is difficult to explain 
the 5D;,.-CH, and CO,-CH, plots by simple two-end-member mixing. 
In other words, a simple mixing scenario with two end members cannot 
explain our gas data, measured at every ~50 cm. We therefore consider 
the property-property plots (Fig. 4) to be unaffected by mechanical 
mixing onthe length scale of 50 cm, although mixing at amuch smaller 
length scale is still possible. 


Effect of respiration on gas properties 

Respiration of detrital organic matter has been shown to lead to the 
in situ production of CO, in the bottom of polar ice sheets”, result- 
ing in elevated concentrations of greenhouse gases compared to the 
contemporaneous atmosphere. We observed substantially elevated 
CO, and CH, concentrations in the basal sections of ALHIC1503 and 
ALHIC1502, which we attribute to respiration and methanogenesis, 
respectively. Two lines of evidence support this hypothesis. First, there 
are extremely depleted (<-500%o) 60,/N, ratios and high (>4%o) 8O.4: 
values in samples near the bottom of ALHIC1503. For comparison, 
typical 50,/N, values in Antarctic ice cores vary between —5 to —20%o* 
and6&80,,, ranges from —O0.5 to +1.5%o” on glacial-interglacial timescales. 
Second, measured 8“C of CO, values in some deep samples from the 
ALHIC1502 and ALHIC1503 cores (Extended Data Fig. 10) are outside 
the range (-6 to -7%o) observed for the last 160 kyr”. This is consistent 
with contributions from respiratory CO, with a 8"C value of about —25%o. 
The deepest measured 6°C sample comes from 3 m above the bedrock 
in ALHIC1503 and has a 6°C value of -22.4%o. Assuming an initial 6°C 
of CO, value of -6.5%o and an 6"C of CO, value of -25%o from respired 
carbon, isotope mass balance suggests that 86% of the CO, inthis sample 
is derived from respiration. 

In four out of the nine 8°C samples, 6°C of CO, was compatible with 
the absence of respiration, assuming a cut-off 6°C value of -7%o. In the 
other five samples, which originated within 7 m of the bottom of the two 
cores, however, 86°C of CO, was lighter than -7%o and indicates respira- 
tion. We marked the shallowest 68°C sample with less than —7%o isotopic 
as a cut-off depth and rejected all CO, samples belowit. Admittedly it is 
still possible that samples immediately above those cut-off depths might 
also be affected, but the highest pre-MPT CO, value is found around 
131m in ALHIC1503, a depth that is bracketed by two °C measurements 
falling between —6% and —7% and is considered respiration-free. Asa 
result, evenif certain samples above the threshold depths are contami- 
nated by respiration, our conclusions would not change. On the basis of 
these results, along with anomalous values of CH,, 50,/N,, and 880 sm 
inthe deeper sections, we excluded biogenic gas data below 185.950 m 
in ALHIC1502 and 139.625 m in ALHIC1503 from our evaluation. We still 
included 5D,,, from those sections in Fig. 2. 


Data availability 


Allan Hills stable water isotope and gas data that support the find- 
ings of this study are available on the United States Antarctic 
Program Data Center (http://www.usap-dc.org/)with the following iden- 
tifiers: DOI: 10.15784/601129 (ALHIC1502 stable water isotopes); DOI: 
10.15784/601128 (ALHIC1503 stable water isotopes); DOI: 10.15784/601201 
(heavy noble gases); DOI: 10.15784/601202 (CO, concentration and 
&°C-CO,); DOI: 10.15784/601203 (CH, concentration); and DOI: 
10.15784/601204 (elemental and isotopic composition of O,, N, and Ar). 
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Extended Data Fig. 1| Satellite imagery of the Allan Hills study area. Right, 
WorldView03 colour pan-sharpened imagery (copyright 2011, DigitalGlobe, 
Inc.) of the mainice field, Allan Hills Blue Ice Area (in true colour mode). Inset, 
Antarctica with hill shading (source: Australian Antarctic Data Centre, Map 
13469; licensed under a Creative Commons Attribution 3.0 Unported License) 
on which our study area is marked by the black square. The black outcrop inthe 
main image is the Allan Hills nunatak. The red arrow marks the local iceflow. 
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Left, a magnified image of the drilling site from same source file. The imagery is 
enhanced (gamma-adjusted on ArcGIS 10) to highlight the colour contrast 
within the blue ice, which now has a greenish hue owing tothe colour rendition. 
The brown lineations in the ice are exposed dust bands, providing a first-order 
tracer of surface ice stratigraphy. The locations of the cores reported in this 
work (see text) are marked with red circles. The location of the GPR profile in 
Extended Data Fig. 2 is shown asa yellow dashed line. 
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Extended Data Fig. 2| GPR profile proceeding from the SW totheNE, 
crossing (within less than 5 m) ALHIC1502 and ALHIC1503. The exact transect 
location is shown in Extended Data Fig. 1. The location and depths of boreholes 
ALHIC1502 and ALHIC1503 (red bars) and ice drilled previously as BIT58 (grey 
bar) are indicated. The profile was collected with an 80-MHzMLF antennaina 
step-and-collect survey style with a step size of 25 cmand astacking rate of 

64 scans. Standard post-processing steps were applied, including time-zero 
position correction, distance normalization, 50/110-MHz finite impulse 
response filter, background removal, and gain adjustment. Depth estimates 
from two-way travel time (TWTT) and migration use a radar travel velocity of 


0.165 mns7.a, Non-migrated radargram showing dipping bed-parallel englacial 
stratigraphy and a strong dipping apparent bedrock reflection. Dashed blue line 
indicates the modelled location of the actual bedrock location as shown in b. 

b, Migrated radargram showing the ‘true’ location of the bedrock. Migration 
quality is poor at this location owing to the steeply dipping slope of the bedrock 
rise. Comparison between aand bsuggests that the correct bedrock reflector 
locationis translated downward and more steeply dipping than indicated inthe 
non-migrated data. ALHIC1502 and ALHIC1503 borehole depths independently 
verify this interpretation. 
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Extended Data Fig. 3 | Minimal impact of gas loss on oxygen and nitrogen 
isotopes. Pair differences of 8'°O j¢, (ASSO ems Fed) and SN (ASYN; blue) are 
plotted against A5O,/N, in ALHIC1502 and ALHIC1503 samples. The difference 
between two replicate samples is attributable to gas loss. The very weak 
dependence of A&0,,,, and ASN on ASO,/N, suggests that gas isotopes are 
relatively insensitive to gas loss fractionation in Allan Hills ice. 
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Extended Data Fig. 4 | Age assignment of CO, and CH, samples. Using the two 
age models discussed in the text— the conservative approach (top and bottom 
left) and the proximity approach (top and bottom right)—CO, and CH, are 
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plotted against 5D,,.. Both of the age models show reduced range inthe MPT and 
pre-MPT ice. We note, however, that the highest CO, and CH, values fall into the 
unassigned category inthe strictest age model. 
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Extended Data Fig. 5 | Evaluating the fraction of CO, range captured by 37 time series (see Extended Data Fig. 5b) by 37 randomly selected samples, given 
discrete samples. a, Ice core CO, record between 0 and 800 ka versus LRO4 different preferential preservation of interglacial ice. Here the term ‘interglacial’ 
benthic foram 6°O stack synchronized onto the AICC2012 timescale. The is operationally defined as any sample with greater than 250 ppm CO,,. Different 
result of alinear regression is shown and used to calculate the synthetic CO, multipliers indicate the multiple occurrence of the ‘interglacial’ ice to simulate 
record between 1.2 and 2.0 Ma.b, Asynthetic CO, record between1.2and2.0Ma _ varying degrees of ice preservation biases. This analysis shows that when the 
reconstructed from LRO4 680 and the regression parameters calculated ina. presence of interglacial ice is no more than eight times greater than that of 


The range of CO, in this synthetic time series is 213-269 ppm.c-h,Fractionofthe _ glacial ice (c-f), we could expect 37 random samples to be likely to capture 
recovered CO, variability (observed range/true range) of the synthetic CO, 66-98% (95% confidence interval) of the CO, range in the ice. 
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Extended Data Fig. 6 | Evaluating the fraction of &“O,,,,and 5Xe/Kr range 
captured by discrete samples. a, b, Allan Hills 5°O,,,, (a) and 6Xe/Kr (b) are 
potted against the 5D,,. of the same depth, colour coded according to their age 
units. Vertical dashed lines represent the range of 5D,.. observed in the 
40k-world ice (-284 to -316%o). Notably, the range of 6D,,. associated with nine 
5Xe/Kr values from the 40k world is only about 40% of the entire 6D,,. range in 
our 40k ice samples (-288 to -300%o); low &D,,- values (less than -300%bo) are 
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missing from the 6Xe/Kr samples. Thus, we donot expect the nine Xe/Kr samples 
to fully capture the range of Xe/Kr ratios in the 40k-world ice. On the other hand, 
the co-depth §D,,, values of 8°O,,.m Samples occupy 97% of the total range, 
implying that 29 6'40,,, samples are likely to cover 70% of the variability 
preserved in the 40k-world ice, barring any diffusive smoothing of the 5“O,,,, 
records. 
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Extended Data Fig. 7 | Gas and ice properties in the interval between 123 and all measurements: 0.010%o for 6°N, 3.063% for 60,/N,, and 0.026%o for 8°O..m- 
143 min ALHIC1503. a, 8°N; b, 50,/N,;¢, 8'°0,,,,;d, 5D;,.. The error bars Note that the range of §°0,,,, is less than 0.2%o. By comparison, glacial-to- 
associated with gas properties represent the pooled standard deviation (10) of interglacial 6°O,,,, variability in the 100-kyr climate cycles is about 1.5%o”". 
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Extended Data Fig. 8 | Evaluating the effect of diffusive mixing onice and gas 
properties. a, Partitioning function Z (ratio of the number gas molecules in the 
gas phase to that in the ice phase) of O, (blue) and N, (red) asa function of 
temperature. b, Characteristic time scale of the self-diffusion of water molecules 
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inice (black), of O, permeation in ice (blue), and of N, permeation in ice (red), 
plotted as a function of temperature. Note that for water molecules the length 
scale of diffusion (L) is 0.1m, whereas for N, and O,/ =0.5m, reflective of their 
respective sampling resolutions. 
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Extended Data Fig. 9 | Evaluating the possibility of flow-induced mixing in 
Allan Hills ice cores. a, A contour map shows the correlation coefficient (R?) 
between 6D,,,and din core ALHIC1503 asa function of the number of adjacent 
stable water isotope data points included inthe calculations, from 5 to 85 witha 
step of 10. The sampling resolution is approximately 10 cm. The high correlation 
coefficient (>0.7) between 132 and 142 m raises the suspicion that mixing has 
taken place in this interval. However, the possibility of mixing is not supported 
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by CO, and CH,. b-d, Cross-plots of 6D;.e—CO, (b), 5D;-e—CH, (¢), and CO,-CH, (d) 
inintervals between 132 and 140 m in ALHIC1503. The black solid lines are mixing 
lines based on two end members at 135.30 m and at 139.76 m, where the 
minimum and maximum CO, values are measured, respectively. Most of the 
points do not fall onto the mixing line, implying that two-end-member mixing 
alone cannot explain the high correlation between 5D,,.and d. 
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plottedinred, and along with §“C, plotted in blue) are also shown for 
comparison. Note the marked scale difference for CO, and 6"C between 


ALHIC1503 and ALHIC1502. 


Extended Data Fig. 10 | Respiration in basal ice revealed by 6°C of CO, in 
ALHIC1503 and ALHIC1502. a, ALHIC1503; b, ALHIC1502. 5D,., (top), CO, 
(middle), and 68°C of CO, (bottom) in Allan Hills ice cores are plotted against 
depth. 5D,..and CO, (both measured independently in smaller samples, 
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Freshwater blooms of phytoplankton affect public health and ecosystem services 
globally”. Harmful effects of such blooms occur when the intensity of a bloom is too 
high, or when toxin-producing phytoplankton species are present. Freshwater blooms 
result in economic losses of more than US$4 billion annually in the United States alone, 
primarily from harm to aquatic food production, recreation and tourism, and drinking- 
water supplies®. Studies that document bloom conditions in lakes have either focused 
only on individual or regional subsets of lakes* ®, or have been limited by a lack of long- 
term observations’ *. Here we use three decades of high-resolution Landsat 5 satellite 


imagery to investigate long-term trends in intense summertime near-surface 
phytoplankton blooms for 71 large lakes globally. We find that peak summertime 
bloom intensity has increased in most (68 per cent) of the lakes studied, revealing a 
global exacerbation of bloom conditions. Lakes that have experienced a significant 
(P<0.1) decrease in bloom intensity are rare (8 per cent). The reason behind the 
increase in phytoplankton bloom intensity remains unclear, however, as temporal 
trends do not track consistently with temperature, precipitation, fertilizer-use trends 
or other previously hypothesized drivers. We do find, however, that lakes witha 
decrease in bloom intensity warmed less compared to other lakes, suggesting that lake 
warming may already be counteracting management efforts to ameliorate 
eutrophication’*”. Our findings support calls for water quality management efforts to 
better account for the interactions between climate change and local hydrological 


conditions”, 


The reported incidence of toxic phytoplankton blooms has risen consid- 
erably over the past half-century“. While it is generally understood that 
nutrient loading drives phytoplankton blooms”, the degree to which 
bloom conditions are changing globally and the factors that drive these 
changes among multiple interacting stressors” are still uncertain”. 
An understanding of global patterns, trends and drivers is necessary, 
however, for designing effective management and remediation strate- 
gies'®. Whereas past studies synthesizing information onthelong-term 
trends in phytoplankton blooms of lakes have been limited by data avail- 
ability, recent advances in cloud-based parallel computing have made it 
possible to leverage high-resolution freely accessible satellite imagery 
over large areas, enabling the study of long-term environmental trends 
ona global scale’””®. 

Here, we take advantage of these advances to generate a long-term 
record of intense, near-surface phytoplankton blooms for dozens of 
large lakes across the globe. We use data from the Landsat 5 satellite to 
generate time series of peak summer bloom intensity from 1984 to 2012 
for 71 lakes in 33 countries across 6 continents (Fig. 1). In total, the data 
span 30,922 scenes and 72.6 billion lake pixels. The study lakes span a 
broad range of physical characteristics and degree of anthropogenic 
impacts (Supplementary Table 1; see Methods for a full description of 
the implemented approach). Seasonal peak bloom intensity fora given 


lake and year is defined based on the maximum observed lake-wide near- 
infrared signal magnitude, witha first-order correction of atmospheric 
interference using the shortwave-infrared signal”. Remotely sensed 
observations within the near-infrared part of the electromagnetic spec- 
trum are sensitive to intense, near-surface algal blooms (see Methods). 
Aninitial superset of 154 lakes was selected based on their inclusion in 
previous studies that leveraged remote sensing by satellites”””’, thus 
reducing the likelihood that persistent cloudiness obscured the images. 
These lakes all have surface areas of more than 100 km’; globally, lakes 
within this size range contain approximately 95% of all water stored in 
lakes”, Data of lakes for which little signal was observed throughout 
the study period, as well as data of lakes for which the signal was far 
outside the range over which the original algorithm was designed”, 
were removed. A smaller number of additional lakes were removed 
due to previously documented evidence of a lack of phytoplankton 
blooms. Of the final selected lakes, 38 have a documented presence of 
harmful cyanobacterial species, while the rest show evidence of other 
phytoplankton species (10 lakes) or no reported evidence of blooms 
(23 lakes). Given the heterogeneity in lake characteristics, the time series 
of the interannual bloom intensity for each lake is normalized by its own 
long-term mean and s.d. to assess the relative change in bloom inten- 
sity over time. This approach eliminates the need to compare absolute 
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Fig. 1| Global distribution of lake bloom intensity trends shows that the 
peak summertime bloom intensity has increased since the 1980s. The map 
shows bloom intensity trends for all 71 study lakes for the period 1984-2012 
(Supplementary Table 1). Colours and symbols indicate whether the bloom 


magnitudes across lakes, which has been an important barrier to past 
syntheses across lakes”. 

We find that the implemented algorithm is able to successfully 
capture previously documented spatial gradients in the severity of 
phytoplankton blooms within individual lakes and temporal trends in 
phytoplankton bloom intensity for specific lakes (Extended Data Fig. 1 
and Methods). Using simulations of atmospheric radiative transfer, 
we also find that the algorithm is insensitive to reported variations 
in Landsat 5 orbit or image radiometric quality, primarily owing to 
the strong signal that arises from the intense, near-surface blooms 
identified in study lakes (see Supplementary Information). These 
results suggest that a single algorithm can indeed identify intense phy- 
toplankton blooms despite the large differences in optical properties 
across lakes”®, as long as the focus is on interannual rather than inter- 
lake variability. This lends support to the approach implemented in 
this study for tracking long-term trends globally. We then used all 71 
study lakes to assess global trends in summertime peak phytoplank- 
ton bloom intensity. We also used a subset of 49 lakes with at least 
14 years of data to explore more detailed historical temporal patterns 
in phytoplankton bloom conditions, for which the 14-year threshold 
was selected on the basis of previously published studies on global 
lake temperatures”. 

We find that peak summertime phytoplankton bloom intensity has 
increased in more than two-thirds of study lakes since the 1980s (48 out 
of 71 lakes) (Fig. 1). Increases in bloom intensity are statistically signifi- 
cant for close to athird ofall lakes (P< 0.1 for 22 out of 71lakes), whereas 
only 6 lakes exhibited a statistically significant decrease in intensity 
(P<0.1). Asimilar proportion of lakes has an increasing bloom intensity 
among those with a documented presence of cyanobacteria (24 out of 
38 lakes) compared to lakes without cyanobacteria (24 out of 33 lakes), 
and the proportion of lakes with increases in bloom intensity is also con- 
sistent across lakes with different areas, volumes, mean and maximum 
depths, and latitudes (see Supplementary Table 1 and Supplementary 
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A Increasing (P < 0.1) 
O Increasing (P > 0.1) 


intensity decreased (blue) or increased (red), and whether the trend is 
statistically significant (triangles for P< 0.1; circles for P> 0.1). The base map was 
generated using Generic Mapping Tools. 


Information). These results suggest that the observed trends are wide- 
spread globally and across lake types, in contrast to previous hypotheses 
of differential impacts as a function of latitude’** or morphometry”. This 
finding provides a global perspective that is consistent with surveys of 
sedimentary records across temperate-subarctic lakes° that show sharp 
increases in the concentrations of cyanobacterial pigments after 1985. 
This finding also corroborates putative trends of increasing harmful 
cyanobacterial blooms globally”, and counters the hypothesis that 
increased reporting of toxic blooms is instead a by-product of increased 
scientific attention®”. 

We find that lake phytoplankton bloom histories follow one of four 
prototypical pathways, termed here ‘sustained improvement ‘improve- 
ment then deterioration’, ‘deterioration’ and ‘no significant trend’ 
(Fig. 2a—d and Methods). The two pathways that include deteriorating 
conditions reveal that increases in peak bloom intensity occurred pre- 
dominantly in the latter half of the study period (Fig. 2b, c). For example, 
three-quarters of study lakes (51 out of 68) with sufficient data for the 
second half of the study period (1998-2012) exhibited an increase in 
bloom intensity during this period, whereas only a third (22 out of 66) 
experienced an increase during the first half (1984-1997). The reason 
behind the temporal coherence of changes in phytoplankton bloom 
intensity remains unclear, as temporal trends do not track consistently 
with temperature, precipitation, fertilizer-use trends, satellite data 
availability or geomorphological characteristics of individual lakes 
(Extended Data Figs. 2-5 and Supplementary Information), nor are 
there widespread trends in the seasonal timing of peak bloom intensity 
(see Supplementary Information). 

We find that although lakes that exhibited sustained improvement 
were rare (n=6), they experienced less warming (or more cooling) rela- 
tive to those that exhibited improvement then deterioration (P=0.09; 
Fig. 3 and Extended Data Fig. 6), suggesting that lake warming may have 
counteracted management efforts in the latter group. This finding sug- 
gests that nutrient reduction targets based on historical relationships 
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Fig. 2| Lake bloom histories follow one of four prototypical pathways. 

a-d, Time series for lakes with at least 14 years of data (n=49) categorized by 
historical pathway. Grey lines show 5-year moving averages of normalized bloom 
intensity, with coloured lines showing pathway averages across lakes. The time 


between bloom severity and nutrient loading may have to be revised 
in the context of climate change, as has been hypothesized”. General- 
izing the impact of warming across a wide range of lakes is inadvisable, 
however, as trends across the full lake ensemble showed little direct 
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Fig. 3|Lakes that experienced improvements in bloom conditions tend to 
have experienced little to no warming. Box plots of the water temperature 
trend (1985-2012) binned by lake historical pathway. Each box extends from the 
first to the third quartile values, witha line at the median. The whiskers extend to 
1.5x the interquartile range from the edges of the box. The plus symbols show 
outlier values past the end of the whiskers. 


series of the bloom intensity z-score for each lake is calculated using its own 
historical mean and s.d.e, Global distribution of lake pathways. The base map 
was generated using Generic Mapping Tools. 


correlation with temperature (Fig. 4, Extended Data Figs. 2,3 and Supple- 
mentary Information). Rather, these findings suggest that the effects of 
global lake warming differ depending on lake-specific characteristics”, 
and highlight the importance of assessing the role of lake attributes in 
modulating the impact of temperature on nutrient-phytoplankton 
relationships”. 

Overall, this study provides a global view of trends in intense lacus- 
trine near-surface phytoplankton blooms over the past three decades. 
We examine bloom histories for lakes with widely differing character- 
istics and geographical locations, and demonstrate the promise of 
long-term satellite observations for tracking intense bloom conditions 
across a heterogeneous set of systems to augment geographically and 
temporally limited in situ monitoring efforts. Our results corroborate 
the putative reported increase in bloom occurrence and intensity glob- 
ally, and highlight that lakes that have exhibited along-term decreasein 
bloom intensity are rare. Results further show that sustained decreases 
in bloom intensity are more likely to have occurred in lakes with little 
or no warming, suggesting that rising lake temperatures may hamper 
environmental recovery, and illustrating the importance of identify- 
ing factors that make some lakes more susceptible to the effects of 
warming. 
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Fig. 4 | Lake bloom histories show no consistent correspondence with 
temperature, precipitation and fertilizer use. a—h, Five-year moving averages 
of normalized near-surface bloom intensity, summer lake temperatures, and 
total precipitation and fertilizer application rate over the watershed for eight 
prototypical lakes. a-d, Lakes follow the sustained improvement pathway. 

e-h, Lakes follow the deterioration pathway. Thicker temperature, precipitation 
and fertilizer lines indicate that the Pearson correlation coefficient with bloom 
intensity is significant (P< 0.1). Dashed lines indicate anti-correlations. 
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METHODS 


Satellite source data and implementation of the bloom detection 
algorithm 

We used all Landsat 5 Thematic Mapper (L5 TM) images over study lakes 
(1984-2012) that covered the five months encompassing summer (June 
to October or December to April, depending on lake latitude, similar 
toa global study of satellite-estimated lake temperatures”). Given its 
long-term archive, LS TM can be used to assess bloom severity in the 
upper layer of the water column”**. Our analysis used images from 
the LS TM top-of-atmosphere (TOA) reflectance image collection in 
Google Earth Engine®, collected originally from the US Geological 
Survey”®. The images in this collection represent unitless planetary 
TOA reflectance (p,)”: 


2 
a= FSUN ? 
,cos(@.) 

where L, is the spectral radiance at the sensor’s aperture 
(W (m’*sr um)”), dis the Earth-Sun distance (astronomical units), ESUN, 
is the mean exoatmospheric solar irradiance (W (m? jum) *) and 6, is 
the solar zenith angle (degrees). LS TM Surface Reflectance products*® 
were not available worldwide for the full period at the time of algorithm 
development”. 

We implemented a compositing technique to create images covering 
the whole surface area of each lake. On the basis of the 16-day revisit 
period of Landsat, we created composites for each lake by stitching 
together all LS TM scenes overlapping with the lake during 16-day time 
intervals. For each year, intervals started on the first day of the first 
month analysed (either 1 June or 1 December) and ran until 144 days 
later, or 9 intervals total. Overlapping pixels from multiple scenes 
were averaged in each final composite. Artefacts of this process were 
observed for some lakes (for example, horizontal or vertical stripes 
at scene boundaries), but were not found to substantially affect sub- 
sequent analyses. 

To mask land within each 16-day composite image, we modified lake 
polygons from the Global Lakes and Wetlands Database”. The original 
polygons were adjusted manually to fully cover the lake surface area 
based on images with maximum lake extent over the study period. Within 
these polygons, we used the Fmask algorithm*,, as implemented in 
Google Earth Engine”, to exclude cloud and cloud shadow pixels, and 
to identify the water surface areain each composite. We tested the use of 
other cloud detection algorithms (for example, the Landsat Automatic 
Cloud Cover Assessment procedure’’), but Fmask resulted in fewer 
misclassification errors due to turbid water and haze. We also tested 
static land cover maps to mask land pixels, but the Fmask water layer 
better accounted for dynamic changes in lake shorelines over time. 

We then applied a bloom detection algorithm based on the near- 
infrared band, using the shortwave-infrared (SWIR) band to minimize 
effects of atmospheric interference anda ‘greenness filter to distinguish 
suspended sediment”. The algorithm subtracts the pixel value in the 
SWIR band (L5 TM band 5, 1.55-1.75 um), weighted with an empirical 
parameter, from the value in the near-infrared band (L5 TM band 4, 
0.76-0.90 tum) as a measure of near-surface bloom intensity: 


B=F,(p,, —1.039,.) (2) 


where Bis the bloom intensity, ranging from 0 to 0.1 where 0.1 gener- 
ally represents intense near-surface phytoplankton blooms, p,, is Ls TM 
TOA band 4, p,; is Ls TM TOA band 5 and F, represents the greenness 
filter that masks out pixels below a certain hue (H) threshold based on 

L5 TM TOA in bands 1, 2, and 3: 
Re 10 if H<16 


ifH>1.6 (3) 
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Under intense, near-surface algal bloom conditions, backscattering 
due to phytoplankton abundance dominates the water-leaving radiance 
in the near-infrared range that is otherwise dampened by pure water 
absorption*** (Extended Data Figs. 7, 8; see Supplementary Informa- 
tion for additional discussion of algorithm sensitivity to in situ bloom 
conditions). Although this approach has proven effective in identifying 
the extent of near-surface intense phytoplankton blooms, we emphasize 
that the retrieval of concentrations of specific bloom severity metrics 
(for example, chlorophyll-a) is beyond the scope of this study. 


Selection of the initial 76 study lakes 

We first selected 154 lakes from those included in a study of global lake 
temperatures” with temperature data collected by satellite. The ration- 
ale for choosing these lakes was twofold: (1) there was a lower likelihood 
of persistent cloudiness obscuring images because these lakes had 
previously been successfully explored using satellite remote sensing 
and (2) the in situ temperature data for other lakes were not collected 
in a consistent way that would be representative of the whole lake (for 
example, some data points were collected at point locations on specific 
shores of the lake). 

After an initial exploration of three randomly selected years of com- 
posite images for each lake, we sub-selected 95 lakes for analysis based 
on the criteria that lake pixels have non-zero bloom intensity values 
below a threshold of 0.1in a substantial portion of images. We did 
this to identify lakes that had ranges of bloom intensity values most 
similar to Lake Erie, for which the algorithm was originally validated”. 
Compared to the 59 lakes that were not selected, these 95 lakes in 
general were shallower and had smaller lake volume, suggesting that 
the approach may have been more applicable for detecting blooms 
in shallower lakes. 

We then performed a literature search to explore whether the 
observed bloom intensity signal in lakes was likely to be indicative of 
real phytoplankton blooms (of any type) or a false-positive result. We 
used ISI Web of Science, Google Scholar and Google Search for the lake 
name, lake name + “algal bloom” and lake name + “eutrophic”. Based on 
the results of this search, we determined that 78 lakes had either some 
evidence of phytoplankton blooms (51 out of 78) or no evidence against 
phytoplankton blooms (27 out of 78), whereas the remaining 17 lakes had 
strong evidence against the signal representing phytoplankton blooms 
(for example, in onelakea high bloom intensity signal was erroneously 
caused by highice reflectance). Two lakes were also removed from the 
dataset at this stage based ona lack of LS TM data during the study period 
(that is, data were available for only one or two years). 

The remaining 76 lakes were selected for further analysis. In these 
lakes, 54% had support from the literature for evidence of presence of 
cyanobacteria, with 29% of lakes specifically dominated by Microcys- 
tis sp. In total, 10,892 composites were compiled from 30,922 LS TM 
scenes during the selected months across study lakes, with a median 
of 139 composites and 283 scenes per lake. The total number of com- 
posites per lake ranged from 24 for Lake Edward (43 scenes total) to 
250 for Lake Winnipeg (1,771 scenes total). Lakes with a greater num- 
ber of scenes tended to be in North America, as expected based on 
historical L5 TM coverage*’. The number of years with available data 
per lake ranged from 5 to 28 years with a mean of 21 years (Supple- 
mentary Table 1); among lakes with at least 14 years of data, the mean 
was 22 years per lake. 
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Validation of well known spatial gradients in bloom intensity 

We further evaluated the proposed approach by comparing geographi- 
cal regions within lakes with known spatial gradients of bloom intensity. 
We searched the literature for descriptions of spatial gradients for the 76 
study lakes, for example, on the basis of chlorophyll-a or phytoplankton 
biomass observations, and then examined algorithm output values in 
regions that would be expected to show the largest differences. Using 
this approach, we identified 48 pairs of regions across 22 lakes (Sup- 
plementary Table 2). For instance, in Lake Balaton we identified three 
regions based on documented chlorophyll-a and biomass gradients from 
the southwest to the northeast” ”. The southwest basins are eutrophic 
to hypertrophic, the northeast basin is mesotrophic and the middle 
basinis in-between’; the expectation is therefore that bloom intensities 
would be higher ina region in the southwest basin relative to aregionin 
the middle basin, which would then also be higher relative to a region 
in the northeast basin. 

We documented the expected bloom intensity in each region qual- 
itatively (that is, high, medium or low) as well as the strength of the 
evidence supporting the expected direction of the gradient between 
regions (that is, strong, medium or weak) (for example, some are based 
onextensive in situ sampling over many years, whereas others are based 
on more qualitative inferences). For each of the 48 pairs of regions, we 
computed the difference in intensity between regions by comparing 
the mean intensity across all pixels within each region of the lake over 
the full study period. 

The gradient between regions that was inferred using the imple- 
mented algorithm was inthe correct direction for over three-quarters 
of region pairs (37 out of 48 pairs). For the Lake Balaton example 
described earlier, all three comparisons between mean pixel values inthe 
three regions were consistent with the expected sign of the difference 

in bloom intensity (high versus medium, high versus low, medium 
versus low). 

The previously published evidence was not strong for 10 out of the 
11 pairs of regions for which the gradient was inthe direction opposite to 
the expected direction. For example, two pairs were for Songkhla Lake, 
where the evidence on the spatial gradient came from one publication 
that was not peer-reviewed and another that was based purely on compu- 
tational modelling rather thanin situ observations”. Because the vast 
majority of the discrepancies in identifying spatial gradients were for 
region pairs with weaker support in the literature, results indicate that 
the implemented algorithm is able to identify well established spatial 
gradients in bloom intensity across a variety of lakes. 

We note that the qualitative approach used here represents a first 
step towards understanding how global patterns in bloom intensity may 
be measured using satellite remote sensing. Although the algorithm 
used here was validated quantitatively for data from only Lake Erie, the 
results of our qualitative approach together with atmospheric radiative 
transfer simulations (see Supplementary Information), indicate that 
the algorithm provides a useful signal of bloom intensity for the lakes 
analysed in this study. Additional validation may be needed for other 
applications. 


Generation of bloom intensity time series and trends 

To generate long-term time series of summertime maximum bloom 
intensities for each lake, we summed the algorithm output values over 
the whole lake for each composite, as a measure of bloom intensity. This 
approach assumes that the algorithm output value is correlated with 
measures such as near-surface chlorophyll-a or phytoplankton biomass, 
whichis supported by validation based on well known spatial gradients, 
as described above. We took the largest composite bloom intensity each 
year as an estimate of summertime maximum bloom intensity, similar to 
other studies””. This approach focuses on the relative spatial intensity 
of phytoplankton blooms over time, and therefore minimizes the impact 
of noise in LS TM images over bodies of water®. 


We removed estimates from each time series when the observed 
lake surface area was less than 80% of maximum, or less than the mean 
minus one s.d. of the whole surface area of the lake for the time series, 
whichever was lower. These guidelines were determined heuristically, 
based on removing composites that visually had large portions of 
the lake surface area unavailable due either to missing LS TM scenes 
or high cloud cover. We also removed estimates for years in which 
there were fewer than 3 composites due to missing data, as these were 
expected to be less representative of the summertime maximum. Main 
findings were not found to be sensitive to minor variations in these 
thresholds. 

We further adjusted for variations in observed lake surface due to 
clouds or missing LS TM scenes by dividing the annual bloom intensity 
estimates by the observed surface area of each lake in each year. This had 
the benefit of correctly adjusting observed bloom severity trends for 
lakes where the water surface area changed substantially over time. For 
example, for a subset of lakes that have dried up during the study period 
(for example, Lake Urmia or the Aral Sea), a decline in water surface area 
would otherwise be incorrectly observed as a decline in bloom intensity. 
Normalizing by water surface area more accurately reflects the true 
bloom conditions in those lakes over time. Although, in principle, this 
could also make blooms of constant severity in an otherwise shrinking 
lake appear to have an increasing trend, this scenario was not found 
among study lakes with declining water surface area. Given the highly 
varying local conditions for study lakes with respect to bloom intensity 
and water surface area trends with time, normalizing by water surface 
area provided the best approach overall for accounting for variations 
due to clouds and missing LS TM scenes. 

Finally, to compare data across different lakes, we normalized the 
annual time series of the peak bloom intensity for each lake by its own 
long-term mean and s.d. values, creating bloom intensity z-scores. This 
is similar to other studies that have treated historical bloom data from 
remote sensing™, tracked long-term trends in cyanobacteria’ or esti- 
mated long-term trends of other parameters in lakes””. 


Evaluation of bloom intensity time series and trends 

To further evaluate the implemented approach, we compared the tem- 
poral evolution of peak bloom intensity in well studied lakes to those 
described in existing literature, and also compared bloom intensity 
trends overall to trends in the SWIR TOA reflectance. 

From each time series of normalized annual peak bloom intensity, we 
tested for the presence of monotonic time trends using the S statistic 
from the Mann-Kendall trend test® and estimated the magnitude of 
temporal trends using Thiel-Sen’s slope. These tests are both non- 
parametric procedures that are known to be more robust to outliers 
and more accurate for skewed or heteroskedastic data” and have been 
used widely for assessing temporal trends in limnologic studies and 
those evaluating trends in phytoplankton blooms specifically°?"*. 
Trend analyses over the whole study period were performed for all 
lakes (n= 76). 

For lakes in which large changes in bloom intensity have been docu- 
mented for the study period, the data used here accurately matched 
both the direction and timing of changes described in the literature. 
For example, substantial improvements in bloom conditions have been 
reported for Lake Balaton®, Clear Lake® and Lake Simcoe’, and 
all showed statistically significant (P< 0.1) decreases in peak bloom 
intensity in the data presented here over the same timeframes as 
described in previous studies. The improvement in water quality of 
Lake Balaton that occurred in the 1990s, which coincided with sewage 
controls and a decline in agriculture”, was reproduced correctly in the 
time series (Fig. 4a). Clear Lake experienced a similar decline in bloom 
intensity in the 1990s—probably due to the trophic cascade stemming 
from drought®—that was also correctly reproduced (Fig. 4b). Further- 
more, for Lake Simcoe, an improvement in water quality that occurred 
soon after 1995, which coincided with a widespread invasion of zebra 


mussels’****°, was also successfully reproduced (Fig. 4d). The developed 


data similarly reproduced histories for lakes with documented increases 
in bloom intensity during the study period, suchas Lake Winnipeg” and 
Lake Baikal**’, and captured documented temporal patterns of decreas- 
ing and increasing bloom intensities in ecosystems as diverse as Lake 
Erie® and Tsimlyansk Reservoir® (Extended Data Fig. 9). 

We further evaluated the performance of the approach for the 
detection of false-positive (that is, high derived bloom intensity for 
low bloom intensity spectra) and false-negative (that is, low derived 
bloom intensity for high bloom intensity spectra) results using the 
atmospheric radiative transfer simulations described in the Supple- 
mentary Information. The analyses demonstrated the robustness of the 
algorithm in limiting false-positive results—that is, correctly identify- 
ing instances in which no blooms were present. However, the analyses 
also identified that aerosol optical thickness (AOT) has an effect onthe 
incidence of false-negative results whereby a higher AOT (for example, 
hazy conditions) resulted in an increased likelihood that high bloom 
intensity events were missing. 

Because SWIR can be used as a proxy for AOT, we next evaluated 
whether derived bloom intensity trends in study lakes could errone- 
ously be due to trends in aerosol conditions that affect the likelihood of 
false-negative results. This was accomplished by comparing peak bloom 
intensity trends to trends inthe SWIR TOA reflectance. For five lakes with 
statistically significant trends (P< 0.1) in both peak bloom intensity and 
SWIR TOA reflectance, we found that observed bloom intensity trends 
coincided with trends in SWIR TOA reflectance, increasing the risk that 
apparent trends in bloom intensity could be due toa change in the like- 
lihood of false-negative results. For these lakes, peak bloom intensity 
and SWIR trends were in opposite directions (that is, increasing SWIR 
resulting in an increased likelihood of false-negative results consistent 
with a decreased bloom intensity trend, and vice versa for decreasing 
SWIR). Decreasing bloom intensity trends were observed in four of the 
five lakes (Chao, Gaoyou, Taihu and Sarykamshskoye), with an increas- 
ing bloom intensity trend observed in the fifth (Kremenshugskoye). 
Increasing SWIR trends in the four lakes were consistent with trends 
in aerosols in Eastern China and central Asia over the study period”, 
whereas a reduction in aerosols in Eastern Europe” was consistent with 
the latter. Because we could not confirm whether or not the observed 
bloom intensity trends were in fact attributable to trends in SWIR, to be 
conservative we removed these five lakes from the subsequent analysis. 
This resulted ina final set of n= 71 lakes for further analysis. For trends 
estimated over shorter periods (1984-1997 and 1998-2012), slightly 
fewer lakes were used (n = 66 and n= 68, respectively) because at least 
two years of observations per period were required to compute trends. 
A subset of lakes with at least 14 years of data (n = 49) was also used to 
explore temporal patterns in peak bloom intensity. 

Beyond this analysis of SWIR trends, we found no other evidence that 
any potential misclassification of bloom intensity trends occurred. 
To assess whether our approach could have been incorrectly measur- 
ing trends in other environmental variables, we explored historical 
patterns of potential confounders that have been documented in the 
literature. Trends in Secchi depth did not match the observed bloom 
intensity trends (for example, in Lake Simcoe”, Great Salt Lake” or Lake 
Okeechobee”), indicating a lack of evidence to suggest that the imple- 
mented algorithm was potentially measuring other constituents of 
water quality. Nor did we find any evidence that global changes in atmos- 
pheric constituents, such as aerosols, dust and water vapour, could 
explain the overall geographical pattern of bloom observations (except 
in the five aforementioned lakes), because such constituents have a 
spatial coherence at large regional scales”””*”° whereas the observed 
lake trends were highly spatially heterogeneous (Fig. 1). For individual 
lakes, bloom intensity trends also did not track well with observed trends 
in submerged aquatic vegetation (for example, Lake Okeechobee”), 
gypsum (for example, Salton Sea”) or cloud cover (for example, Lake 
Nicaragua’’). Taken together, this suggested that our findings about 


the global proportion of lakes with increasing bloom intensity trends 
were likely to be robust. 


Characteristics of the final 71 study lakes 

The 71 study lakes (Supplementary Table 1) spanned a wide range of 
surface areas (158 to 67,052 km’) and maximum depth (2 to 1,637 m) 
comparable to previous global studies of lakes”’. Of the 49 lakes with 
at least 14 years of bloom data, alarge majority warmed over the study 
period (88%) (Extended Data Fig. 6), with the temperature trend rang- 
ing from -1.40 °C per decade (that is, cooling) to 0.93 °C per decade. 
Most of these 49 lakes also experienced an increase in annual pre- 
cipitation (61%, ranging from —49 to 173 mm per decade) while close 
to half of the lakes experienced an increase in fertilizer application 
rate (49%, ranging from -1.47 to 2.41 Mg N km” per decade) (Sup- 
plementary Table 1). 


Categorization of lakes by prototypical historical pathway 

To bin the lakes by prototypical historical pathway, we fit a linear 
model with time for each lake time series using ordinary least squares 
regression: 


Y= Bet By (5) 


where y represents the normalized maximum summertime bloom 
intensity, trepresents the year of the observation, and £, and B, are 
the fitted model parameters. Bloom intensity values from individual 
lake pixels (B from Eq. (2)) were summed for each image composite, 
and the maximum summed bloom intensity for each year was used to 
create the time series y after correcting for missing data, subtracting 
the long-term mean and dividing by the long-term s.d. Lakes for which 
the linear term was statistically significant (P< 0.1) were categorized 
as sustained improvement ifthe peak bloom intensity trend decreased 
with time (G, < O) and deterioration if the peak bloom intensity trend 
increased with time (6, > 0). 

For the remaining lakes, we fit a quadratic model to each lake time 
series: 


y=B,t?+B,t+ B, (6) 


where a third term is added indicating a change in peak bloom 
intensity with @. Lakes for which the quadratic term was statistically 
significant (P< 0.1) and £, > O were categorized as improvement then 
deterioration. The remaining lakes were categorized as no significant 
trend. 

This approach used both simple monotonic trends with time as well 
as an assessment of the degree of improvement and deterioration to 
categorize bloom intensity trends. Categorization of lakes (eutrophica- 
tion, restoration or no change) on the basis of simple changes in lake 
parameters (increasing, decreasing or no consistent change, respec- 
tively) has been used previously to understand long-term trends°, as 
have comparisons of multiple measurements over time to assess the 
balance between historical deterioration and improvement”. 


Data availability 

The Landsat 5 Thematic Mapper imagery used in this study is available 
from the US Geological Survey (http://earthexplorer.usgs.gov) and 
through Google Earth Engine (https://earthengine.google.com). The 
bloom intensity trend estimates, historical pathway categories and 
environmental driver variables generated for each lake and analysed 
in this study are provided in Supplementary Table 1. The temperature, 
precipitation, fertilizer use and lake geomorphological data supporting 
the findings of this study are publicly available” (see ‘Environmental 
driver, watershed, and geomorphological characteristic data sets’ in 
the Supplementary Information). 
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Code availability 


Google Earth Engine’s web interface allows the bloom detection algo- 
rithm7'to be applied on any Landsat 5 Thematic Mapper images. Access 
will be provided upon request. 
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there is evidence of major ecological changes. The y axis shows temporal trends values in each sub-period separately. 
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Extended Data Fig. 3 | Norelationship is observed between bloom intensity 
and environmental factors collected from all lakes. a—c, Scatter plots of 
bloom intensity z-score compared with temperature (a; n= 784), precipitation 
(b;n=936) and fertilizer (c;n=980) z-scores. Each circle represents one year for 
one lake. The z-score of each lake variable is calculated using the mean and s.d. of 
its owntime series. Red lines indicate the linear fit of the white circles. d, Box 
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plots of bloom intensity z-score (n= 980 total). Each box plot shows the 
distribution of z-scores for all lakes with available data each year. Each box 
extends from the first to the third quartile values, with a line at the median. The 
whiskers extend to1.5x the interquartile range from the edges of the box. The 
plus symbols show outlier values past the end of the whiskers. 
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Extended Data Fig. 4 | Availability of bloom intensity data during the study period. Number of lakes witha bloom intensity observation after correction for 
clouds and number of composites (see Methods) divided by the total number of study lakes (n=71) for each year. 
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Extended Data Fig. 5 | Distributions of lake variables by historical pathway. a—c, Distributions of environmental drivers. d-i, Distributions of geomorphological 
factors. The data ina are equivalent to those in Fig. 3. 
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Extended Data Fig. 6 | Global distribution of trends in lake temperature. For the lakes with at least 14 years of bloom data (n = 49), the maps show the temporal 
trend in lake surface water temperature (°C per decade). The base map was generated using Generic Mapping Tools”. 
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and 22.7 gm°, respectively. The non-bloom curve corresponds to chlorophyll-a 
and total suspended solid concentrations of 5.8 mgm™~and1.8¢m°, 
respectively. The normalized spectral response of the near-infrared channel of 
LS TMis also shown. The spectra used in the sensitivity analyses demonstrate 
the robustness of the bloom intensity measure used in this study (Eq. (2)). 


Extended Data Fig. 7 | Spectral reflectance curves used in simulations to test 
algorithm sensitivity. Spectral reflectance curves (p,,[-]) associated with three 
phytoplankton bloom concentrations and one non-bloom water condition 
measured in Lake Erie are shown**. Blooms I, Il and III correspond to near-surface 
chlorophyll-a concentrations of 100.1mg m°, 143.7 mgm“ and106.3mgm°*, 
respectively, and total suspended solid concentrations of 30.1gm™,20.0gm°> 
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Extended Data Fig. 9 | Historical bloom intensity patterns for four 
additional lakes with well documented temporal trends. Graphs as in Fig. 4 
for four additional lakes that had well documented temporal trends. Panels show 
five-year moving averages of normalized bloom intensity, summer lake 


temperatures, and total precipitation and fertilizer application rate over each 
lake’s watershed. Thicker temperature, precipitation and fertilizer lines indicate 
that the Pearson correlation coefficient with bloom intensity is significant 
(P<0.1). Dashed lines indicate anti-correlations. 
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Recent reports of local extinctions of arthropod species’, and of massive declines in 
arthropod biomass’, point to land-use intensification as a major driver of decreasing 
biodiversity. However, to our knowledge, there are no multisite time series of 
arthropod occurrences across gradients of land-use intensity with which to confirm 
causal relationships. Moreover, it remains unclear which land-use types and arthropod 
groups are affected, and whether the observed declines in biomass and diversity are 
linked to one another. Here we analyse data from more than 1 million individual 
arthropods (about 2,700 species), from standardized inventories taken between 2008 
and 2017 at 150 grassland and 140 forest sites in 3 regions of Germany. Overall gamma 
diversity in grasslands and forests decreased over time, indicating loss of species 
across sites and regions. In annually sampled grasslands, biomass, abundance and 
number of species declined by 67%, 78% and 34%, respectively. The decline was 
consistent across trophic levels and mainly affected rare species; its magnitude was 
independent of local land-use intensity. However, sites embedded in landscapes witha 
higher cover of agricultural land showed a stronger temporal decline. In 30 forest sites 
with annual inventories, biomass and species number—but not abundance—decreased 
by 41% and 36%, respectively. This was supported by analyses of all forest sites sampled 
in three-year intervals. The decline affected rare and abundant species, and trends 
differed across trophic levels. Our results show that there are widespread declines in 
arthropod biomass, abundance and the number of species across trophic levels. 
Arthropod declines in forests demonstrate that loss is not restricted to open habitats. 
Our results suggest that major drivers of arthropod decline act at larger spatial scales, 
and are (at least for grasslands) associated with agriculture at the landscape level. This 
implies that policies need to address the landscape scale to mitigate the negative 
effects of land-use practices. 


Much of the debate surrounding the human-induced biodiversity crisis 
has focused on vertebrates’, but population declines and extinctions 
may be even more substantial in small organisms such as terrestrial 
arthropods‘. Recent studies have reported declines in the biomass of 
flying insects’, and in the diversity of insect pollinators>*, butterflies 
and moths?” ”°, hemipterans””” and beetles’”*. Owing to the associated 
negative effects on food webs”, ecosystem functioning and ecosystem 
services", this insect loss has spurred an intense public debate. However, 
time-series data relating to arthropods are limited, and studies have 
so far focused ona small range of taxa"”*"*, a few types of land use and 


habitat’”—or even on single sites’. In addition, many studies lack species 
information’ or high temporal resolution””. It therefore remains unclear 
whether reported declines in arthropods are a general phenomenon 
that is driven by similar mechanisms across land-use types, taxa and 
functional groups. 

The reported declines are suspected to be caused mainly by human 
land use’. Locally, farming practices can affect arthropods directly by 
application of insecticides*”, mowing” or soil disturbance, or indirectly 
via changes in plant communities through the application of herbi- 
cides or fertilizer”. Forestry practices can also affect local arthropod 
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communities via changes in tree species composition or forest struc- 
ture”. In addition, local arthropod populations can be affected by land 
use in the surrounding landscape; for example, through the drift and 
transport of pesticides and nitrogen by air or water?>™, through the 
effects of habitat loss on meta-communities (source-sink dynamics”) 
or by hampering dispersal. 

To disentangle the local and landscape-level effects of land use on 
temporal trends in arthropod communities of grasslands and forests, 
we used data from the ‘Biodiversity Exploratories’ research programme 
that pertain to more than 1 million individual arthropods (2,675 species) 
(Extended Data Table 1). Arthropods were collected annually at 150 grass- 
land sites by standardized sweep-net sampling inJune and August from 
2008 to 2017, and at 30 forest sites with flight-interception traps over 
the whole growing period from 2008 to 2016. An additional 110 forest 
sites were sampled in 2008, 2011 and 2014 to test for trends across a 
larger number of sites. Both the grassland and the forest sites cover 
gradients in local land-use intensity. Land-use intensity was quantified 
inthe form of compound indices that are based on grazing, mowing and 
fertilization intensity in grasslands”, and on recent biomass removal, the 
proportion of non-natural tree species and deadwood origin in forests”. 
To analyse landscape-level effects, we quantified the cover of arable 
fields, grassland and forest in circles, with a radius between 250 m and 
2km, around each sampling site. We modelled temporal trends in arthro- 
pod biomass (estimated from body size; Methods), abundance and the 
number of species separately for grasslands and forests, and tested 
for the effects of local and landscape-scale land-use intensity on these 
trends, accounting for weather conditions. Analyses were conducted 
for all species together, and for different dispersal and trophic guilds. 

The total number of arthropod species across all sites (gamma diver- 
sity) was substantially lower in later than in earlier years in both forests 
and grasslands (Fig. 1). Gamma diversity, biomass, abundance and num- 
ber of species fluctuated over time but revealed an overall decrease 
with strongest declines from 2008 to 2010, especially in grasslands 
(Fig. 1). Year-to-year fluctuations in arthropod biomass, abundance 
and number of species were partially explained by weather conditions 
(Extended Data Fig. 1, Supplementary Table 1-1, Supplementary Informa- 
tion section 2). Accounting for weather, fitted trends from our models 
showed declines in biomass of 67% for grasslands and 41% for forests, 
declines in species numbers of 34% for grasslands and 36% for forests, 
and declines in abundance of 78% for grasslands, with no significant 
change in abundances for forests (—17%) (Fig. 1, Supplementary Table 3- 
1). Ingrasslands, declines occurred consistently across all trophic guilds 
(herbivores, myceto-detritivores, omnivores and carnivores), although 
the trend for carnivores was not significant (Supplementary Table 1-1). In 
forests, the patterns were more complex: herbivores showed an increase 
in abundance and species number, whereas all other trophic guilds 
declined. Temporal trends of arthropods on the basis of data recordedin 
3-year intervals from all 140 forest sites were similar to the trends based 
on the 30 sites with annual data (Supplementary Table 1-1). Sensitivity 
analyses that removed or reshuffled years showed that the decline was 
influenced by, but not solely dependent on, high numbers of arthropods 
in 2008. Fluctuations in numbers (including the numbers from 2008) 
appear to match trends that have been observed in other studies”, which 
suggests that the recent decline is part of alonger-term trend that had 
begun by at least the early 1990s (Extended Data Fig. 2, Supplementary 
Information section 3). Further sensitivity analyses showed consistent 
declines when data from individual sampling dates were not aggregated 
per year, and also showed that declines concerned all three regions that 
we analysed (Supplementary Tables 3-2, 3-3, Supplementary Fig. 3-1). 

Linking changes in biomass, abundance and the number of species 
to one another enables further inferences regarding the mechanisms 
that drive arthropod declines. In grasslands, both abundant and less- 
abundant species declined in abundance (Fig. 2), but loss inthe number 
of species occurred mostly among less-frequent species (Fig. 1, Extended 
Data Fig. 3, Supplementary Information section 4). This suggests that the 
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Fig.1| Temporal trends in arthropod communities. a~d, Gamma diversity 
(total number of species across all grassland or forest sites) (a), biomass (b), 
abundance (c) and number of species (d) of arthropods were recorded in 30 
forest and 150 grassland sites across Germany. Gamma diversity shows mean 
incidence-based, bias-corrected diversity estimates (Chao’s BSS, thatis, the 
higher value of the minimum doubled reference sample size and the maximum 
reference sample size among years”’) for g=O and 95% confidence intervals 
derived from bootstrapping (n =200). Non-overlapping confidence intervals 
indicate significant difference*°. Box plots show raw data per site and year 
(n=1,406 (grassland) or 266 (forest) independent samples). Solid lines indicate 
significant temporal trends (P< 0.05) based on linear mixed models that 
included weather conditions, and local and landscape-level land-use intensity as 
covariates. Shaded areas represent confidence intervals. Boxes represent data 
within the 25th and 75th percentile, black lines show medians, and whiskers show 
1.5x the interquartile range. Data points beyond that range (outliers) are not 
shown for graphical reasons. Plots for biomass and species number have 
separate y axes for grassland and forest. 


decline inthe number of species in grasslands was attributable mainly 
toaloss of individuals among rare species. In forests, species that were 
initially less abundant decreased in abundance, whereas some of the 
most abundant species—including invasive species and potential pest 
species—increased in abundance (Fig. 2, Supplementary Table 5-1). The 
loss of species was, however, irrespective of their frequency (Fig. 1, 
Extended Data Fig. 3, Supplementary Information section 4). This sug- 
gests that the decline of arthropods in forests is driven by mechanisms 
that negatively affect the abundances of many species, which leads to an 
overall decline in biomass and the number of species but favours some 
species that are able to compensate declines in abundance. 

The magnitudes of declines in biomass, abundance and the number of 
species in arthropod communities were independent of local land-use 
intensity (Supplementary Table 1-1) as well as changes in plant commu- 
nities (Supplementary Information section 6) at all sites. However, in 
forests declines inthe number of species were weaker at sites with high 
natural or anthropogenic tree mortality, possibly owing to increased 
heterogeneity in local habitats (Extended Data Fig. 4). Landscape com- 
position had no effect on arthropod trends in forests (note that forest 
sites covered only limited gradients of the landscape variables, Extended 
Data Fig. 5), but it mediated declines in the number of species in grass- 
lands: the magnitude of the declines increased with increasing cover of 
arable fields, and marginally increased with cover of grasslands in the 
surrounding landscape (Fig. 3, Supplementary Table 1-1). This suggests 
that major drivers of arthropod decline in grasslands are associated with 
agricultural land use at the landscape scale. 

The interaction between a species and the landscape around its 
habitat depends on its dispersal ability, which ultimately determines 
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Fig. 2| Changes inthe dominance of species. Rank abundance curves of 
arthropod communities for the first two (2008-2009) and final two (2016-2017 
for grasslands and 2015-2016 for forests) years of the study, from 150 grassland 
and 30 forest sites. The insets show enlarged curves for the 30 most-abundant 
species. Data from the first two and final two study years were pooled 


its occurrence and persistence”. In grasslands, taxa of high and low 
dispersal ability (Methods) both declined, but an increasing cover of 
arable fields—although not of grasslands—in the surroundings ampli- 
fied declines in the biomass of weak dispersers more strongly than it 
did declines of strong dispersers (Fig. 3, Supplementary Table 7-1). 
Weak dispersers may experience higher mortality during dispersal, 
and thus have a lower chance of (re)colonization of a particular site 
when arable field cover is high. In forests, strong dispersers declined 
in biomass, abundance and the number of species, whereas weak dis- 
persers increased in abundance and biomass—but less strongly when 
grassland cover in the landscape was high (Supplementary Table 7-1). 
This suggests that the drivers behind arthropod declines in forests also 
act at landscape-level spatial scales. 

We showed that arthropods declined markedly not only in biomass 
but also in abundance and the number of species, and that this affected 
taxa of most trophic levels in both grasslands and forests. Declines in 
gamma diversity suggest that species might disappear across regions. 
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Fig.3|Landscape effects on arthropod decline in grasslands. a, Temporal 
changes in biomass, abundance and the number of species for all arthropod 
species. b,c, Temporal change in biomass of species with high (b) or low (c) 
dispersal ability, conditional on the cover of arable fields in the surrounding 
landscape (1-km radius). The decline in biomass increased significantly with the 
cover of arable fields for weak dispersers, but not for strong dispersers. Slopes 
were derived from models that included weather conditions and local land-use 
intensity as covariates. The yaxes are log-scaled, but show untransformed values. 
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Our results also indicate that the major drivers of arthropod decline in 
both habitat types act at landscape-level spatial scales, but that declines 
may be moderated by increases in heterogeneity of local habitats in 
forests. Although the drivers of arthropod decline in forests remain 
unclear, in grasslands these drivers are associated with the proportion 
of agricultural land in the landscape. However, we cannot ascertain 
whether the observed declines are driven by the legacy effects of his- 
torical land-use intensification or by recent agricultural intensification 
at the landscape level; for example, by the decrease of fallow land and 
field margins rich in plant species, the increased use of pesticides or use 
of more potent insecticides (Supplementary Information section 3). 
Time-series data relating to changes in the use of agrochemicals or the 
presence of fine-scale arthropod habitats would be necessary to answer 
this question. Furthermore, the extents to which changes in climate have 
reinforced the observed trends in arthropod biomass, abundance and 
number of species is unclear (Supplementary Information section 2). 
Our results show that widespread arthropod declines have occurred in 
recent years. Although declines were less pronounced during the second 
half of our study period, there is no indication that negative trends have 
been reversed by measures that have been implemented in recent years. 
This calls for a paradigm shift in land-use policy at national and interna- 
tional levels to counteract species decline in open and forested habitats 
by implementing measures that are coordinated across landscapes 
and regions. Such strategies should aim to improve habitat quality for 
arthropods and to mitigate the negative effects of land-use practices 
not only ata local scale (within isolated patches embedded in an inhos- 
pitable agricultural matrix) but also across large and continuous areas. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Study system, land-use measures and weather data 

The study was conducted as part of the Biodiversity Exploratories pro- 
ject (www.biodiversity-exploratories.de) in three regions of Germany: 
(1) Schwabische Alb in southwestern Germany (460-860 m above sea 
level (asl)); (2) Hainich-Diin in central Germany (285-550 mas); and (3) 
Schorfheide-Chorin in northeastern Germany (3-140 masl). The three 
regions differ in climate, geology and topography, but each is character- 
ized by gradients of land-use intensity in grasslands and forests that are 
typical for large parts of temperate Europe”. 

Atotal of 150 grassland sites of 50 x 50 min size (SO per region) and140 
forest sites of 100 x 100 minsize (49 in Schwabische Alb, 50 in Hainich-Diin 
and 41in Schorfheide-Chorin), located within larger management units, 
were selected from a total of about 3,000 candidate sites by stratified 
random sampling to ensured that the selected sites covered the whole 
range of land-use intensity and to minimize the confounding effects of 
spatial position or soil type*.. All sites have a long history of the same 
type of land-use and of broadly similar land-use intensity. Land use is 
conducted by landowners or tenant farmers (rather than by the scien- 
tific consortium) and the start of the project did not cause changes in 
land use. Local land-use intensity decreased significantly in forests and 
marginally in grasslands over the course of our study (Supplementary 
Information section 8). No pesticides were applied at any of the sites, 
except for application of herbicides in grasslands in five occasions (site 
number and year: AEG2 2011, HEG2 2013, HEG36 2014, HEG37 2014 and 
HEG12015). 

In grasslands, the gradient of land-use intensity ranged from semi- 
natural to intensively managed grasslands. Natural grasslands, which 
donot require management to prevent succession to forest, are almost 
entirely absent from western and central Europe. All sites were con- 
tinuously managed by farmers. Information on management practices, 
including the level of fertilization (kg N ha“ year“), grazing (number 
of livestock units ha“ year“) and mowing (number of cuts year”), was 
assessed annually by standardized interviews with the farmers. Local 
land-use intensity in grasslands was then quantified as acompound 
index by summing the standardized intensities (that is, divided by the 
global mean value) of these three components”. We then calculated 
the mean local land-use intensity for each site over the ten years of our 
study (2008-2017). The least intensively managed grasslands are often 
located within protected areas (n= 47 sites, including 15 sites in strictly 
protected areas) and are typically grazed by 40-50 sheep per hectare 
for about 10 days per year (or more rarely by 1-3 cattle per hectare for 
20 days), unfertilized and not mown. Grasslands of intermediate land- 
use intensity are usually unfertilized (or fertilized with less than 30 kg 
Nhayear”), and are either mown twice a year or grazed by 4 cattle per 
hectare for about 50 days. The most intensively managed grasslands 
are typically fertilized (60-120 kg N ha™ year’) and either mown 2 or 
3 (maximum of 4) times a year, grazed by 5-10 cattle per hectare for 
100-150 days, or both mown and grazed. 

In forests, the gradient of land-use intensity included three broad 
categories: unmanaged broadleaf, managed broadleaf and managed 
conifer forest. The least intensively managed forests have been managed 
to some degree in the past, but are now often located within protected 
areas (n=56 sites, including 31 sites in strictly protected areas; 14 and 9, 
respectively, of these sites have annual arthropod data). The naturally 
dominanttree species at all sites is European beech (Fagus sylvatica). The 
dominating conifer species are Norway spruce (Picea abies) and Scots 
pine (Pinus sylvestris), which are native to central Europe but would be 
absent or rare in the study regions under natural conditions. However, 
these trees have been cultivated in the study regions for approximately 


250 years. Onall sites, inventories of living trees, stumps and deadwood 
were conducted between 2009 and 2011”. To obtain a continuous fine- 
grained measure for local land-use intensity in forests, we calculated 
a compound index on the basis of three indicators scaled O-1: recent 
biomass removal (volume of harvested timber divided by the sum of 
the volume of living trees, harvested timber and remaining deadwood), 
proportion of tree species that do not belong to the native vegetation 
community (volume of standing timber, harvested timber and dead- 
wood of non-native tree species (including spruce and pine) divided by 
thesum ofthe volume of all tree species) and deadwood origin (volume 
of deadwood with saw cuts divided by the total volume of deadwood)”. 

Land-use intensity at the landscape scale was quantified by meas- 
uring the proportion of area covered by arable fields, grasslands and 
forests within circular areas around the centre of our sites. As the scale 
of effect was unknown, we considered different area sizes with radii 250, 
500, 1,000, 1,500 and 2,000 m. We used vector land-cover data from 
ATKIS Basis DLM (license agreements: GeoBasis-DE/LGB 2017, BG-D 
29/17) with +3 maccuracy of polygon borders, representing conditions 
between 2008 and 2010. We are aware that land cover is only a coarse 
measure for land-use intensity at the landscape scale, but information 
on fine-scaled habitat availability (such as field margins and within-field 
plant diversity) or details on land-use practices (such as the amount of 
fertilizers and pesticides used) was not available. 

Air temperature was recorded at all 290 sites with hourly resolution 
starting between early 2008 and early 2009, depending on the site. 
Gaps within the time series at individual stations were filled on the basis 
of average linear relationships with neighbouring stations within the 
three regions. To derive complete time series from winter 2007-2008 
onward, the initial time span was filled on the basis of data from the 
station network of the German Weather Service that surrounds each 
Exploratory (five stations each). Using 10x space-time cross-validation 
and a forward feature selection approach, the best individual subset of 
the 5 surrounding stations for each of the 290 stations to be filled was 
identified and a multiple linear model was used to predict the missing 
values. Precipitation for site was derived from the RADOLAN product 
of the German Weather Service (hourly radar-based precipitation esti- 
mates corrected by gauge measurements, with a resolution of 1 km? 
and 0.1 mm/h). From these products, we calculated—for each region 
and year—the mean temperature, number of frost days (daily minimum 
temperature <O °C), number of warm days (daily mean temperature 
>20 °C) and precipitation sum for winter (from November of the previ- 
ous year to February), growing period (from March to October) and 
year (from November of the previous year to October). Gap filling at 
the start of the time series was conducted in R version 3.5.1. Other 
computations used the climate-processing software TubeDB (https:// 
environmentalinformatics-marburg.github.io/tubedb). 


Arthropod sampling 

Arthropods were sampled annually in a consistent and standardized 
way from all 150 grassland sites from 2008 to 2017 and from 30 for- 
est sites from 2008 to 2016. On the other 110 forest sites, arthropods 
were sampled by the same method and with the same sampling effort 
in 2008, 2011 and 2014. In grasslands, all arthropods of the herb layer 
were sampled twice per year inJune and August to represent different 
phenological windows within the peak season of adult arthropod activ- 
ity. Onthe basis of monthly samplings at the beginning of the study, we 
identified these two months as representing the best trade-off between 
reducing sampling effort and covering most species. Arthropods were 
sampled by sweep netting along a150-m-long transect that comprised 3 
of the virtual borders of a site by conducting 60 double sweeps per site”. 
Sweep netting was conducted only on days without rain, with low wind 
speed and after morning dew had dried. To reduce potential observer 
bias, personnel were trained and changes in personnel were reduced 
as muchas possible (one change in Schwabische Alb and Hainich-Diin 
regions in 2009; two changes in Schorfheide-Chorin region in 2009 and 
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2010; Supplementary Information section 3). In forests, flying insects 
were sampled using two flight-interception traps per site located close 
to two randomly selected corners of each site. Traps consisted of two 
crossed transparent plastic shields (40 x 60 cm) with funnels opening 
into sampling jars below and above the shields that were filled with 3% 
CuSO, solution anda drop of detergent”. Traps operated from March to 
October and were emptied monthly. All sites are part of larger manage- 
ment units, and therefore no edge effects owning to changes in land-use 
intensity at site borders are to be expected. 

Allsamples were sorted to the order level in the laboratory. For taxo- 
nomic groups that occurred in larger numbers, and for which expert 
taxonomists were available, adult specimens were identified at species 
level: for grasslands, these included species in the Araneae, Coleoptera, 
Hemiptera (both Heteroptera and Auchenorrhyncha; some hemipterans 
were classified only to family or subfamily level) and Orthoptera, and 
for forests, these included species in the Coleoptera and Hemiptera (in 
Heteroptera). Only very few adults in these taxonomic groups could not 
be identified to the species level (1.1% in grasslands and 0.7% in forests), 
and these were excluded from the analyses. In grasslands, we additionally 
counted the number of individuals per order for groups for which no tax- 
onomists were available: these included Acarina, Blattodea, Collembola, 
Dermaptera, Diptera (which was divided into Brachycera and Nema- 
tocera), Ephemeroptera, Hemiptera (Psyllidae and Aphidoidea), Hyme- 
noptera (divided into Apocrita, Symphyta and Formicidae), Isopoda, 
Lepidoptera, Mecoptera, Megaloptera, Neuroptera, Odonata, Opiliones, 
Plecoptera, Pseudoscorpiones, Psocoptera, Raphidoptera, Strepsiptera, 
Thysanoptera and Trichoptera. Information on body length, trophic 
level and dispersal ability for identified species was obtained from the 
literature***>. We estimated the biomass of all arthropod specimens that 
were identified to species level by applying a previously developed*® 
general power function: biomass (in g) = 0.305 x L?®/1,000 where L is the 
mean body length ofa species in millimetres. All arthropods identified 
to the species level were assigned to one of four trophic groups (her- 
bivores, myceto-detritivores, carnivores and omnivores) on the basis 
of their known main food resource as adults. Because typical dispersal 
distances are unknown for most arthropod species, we classified species 
according to morphological characteristics and behavioural traits within 
taxonomic groups (for example, wing development, ballooning or hunt- 
ing strategy)*. Dispersal ability-ranging from 0 to lin steps of 0.25—was 
defined differently for the groups, considering wing dimorphism (for 
Hemiptera, Coleoptera and Orthoptera), flying ability (Coleoptera) as 
well as information on migration and ballooning behaviour (Araneae) 
(details have previously been published). All species with a dispersal 
ability <O.5 were considered to be weak dispersers, and all species with 
a dispersal ability >O0.5 were considered to be strong dispersers. 


Vegetation sampling 

Plant communities in all 150 grassland sites were recorded in an area 
of 4 x 4m between mid-May and mid-June from 2008 to 2017, and in 
30 forest sites in an area of 20 x 20 mtwicea year (spring and summer) 
from 2009 to 2016, by estimating the cover of each species. Ellenberg 
indicator values were taken froma previous publication”. 


Statistical analyses 
Allanalyses were conducted in R v.3.5.1”. 

We performed principal component analyses (PCAs) and pairwise 
correlation tests including all weather variables. On the basis of the 
results, and similar to a previous publication’, we selected mean winter 
temperature and precipitation during the growing period for subse- 
quent analyses because these variables were weakly correlated and 
represented both seasons and both temperature and precipitation 
(Extended Data Fig. 6). Despite considerable temporal fluctuations, 
mean winter temperature increased and precipitation during the grow- 
ing period decreased within our study period (Extended Data Fig. 7). 
PCAs and pairwise correlation tests for landscape-level variables enabled 


us to select cover of arable fields and cover of grassland as independent 
predictors because these were weakly correlated across spatial scales, 
whereas forest cover was correlated to both cover of arable fields and 
grassland (Extended Data Fig. 6). 

We calculated gamma diversity (estimated total number of species) 
across 150 grassland and 30 forest sites separately for grassland and 
forest for each year using the ‘diversity accumulation curve’ frame- 
work that extends methods for rarefaction and extrapolation of species 
richness”’. We used Chao’s BSS based on the frequencies of species: 
the higher value of the minimum doubled reference sample size, and 
the maximum reference sample size among years as incidence-based, 
bias-corrected diversity estimates for g= 0, 1and 2”. This approach 
accounts for slight differences in site numbers between years caused 
by limited accessibility or failure of traps. With increasing order gq, the 
more frequent species are more strongly weighted (q=0 equals species 
richness, g=1equals the exponential of Shannon entropy and g=2 equals 
the inverse of Simpson diversity), which enables us to assess whether 
changes in gamma diversity depend on the frequencies of species. Using 
different reference-sample sizes resulted in consistent results (data not 
shown). Confidence intervals were calculated by bootstrapping (n=200 
bootstraps). 

We aggregated data from all arthropods identified to species per site 
and year to calculate biomass, abundance and the number of species 
for all species, and separately for each trophic and both of the dispersal 
groups. For grasslands, we additionally calculated the abundance ofall 
arthropods per site and year, including groups that were not identified 
to species level. To identify the scale of the effect for landscape-level 
land-use intensity®®, we conducted a multiscale analysis by correlating 
arthropod biomass, abundance and the number of species with the cover 
of arable fields and cover of grassland separately for radii of 250-2,000 
m. For this, only data from a random subset of sites with non-overlapping 
buffers at the 2,000-m scale were used, and this procedure was repeated 
100 times. In grasslands, correlations increased initially with increasing 
radius but started to plateau at 1,000 m (Extended Data Fig. 8). Owing 
to the higher overlap of buffers of neighbouring sites at larger spatial 
scales, we thus present results for all grassland analyses at the 1,000-m 
scale. In forests, the patterns were more complex, but—because of the 
small range of agricultural land-use variables at small scales (Extended 
Data Fig. 5) and the higher overlap of buffers of neighbouring sites at 
larger spatial scales—we also present the results for all forest analyses 
at the 1,000-m scale. 

To test for temporal trends in our arthropod data, we fitted general- 
ized linear mixed models with Poisson errors for count data (abundance 
and species number; function glmer in package Ime4) and linear mixed 
models with Gaussian errors for biomass (log-transformed; function 
Imer), separately for grasslands and forests. For forests, we analysed 
the annual data from 30 sites and the 3-year-interval data from 140 sites 
separately. Separate models were fitted for trophic groups. Fixed effects 
included year, weather (mean winter temperature, precipitation during 
the growing period and their interaction), local land-use intensity and 
landscape-level land-use intensity (cover of arable fields and cover of 
grassland within a radius of 1,000 m), as well as interactions between 
year and local land-use intensity and between year and landscape-level 
land-use intensities. Models included the site nested in the region as 
a random effect to account for the nested design and the repeated 
measures at the site level. Poisson models included an observation- 
specific random effect to account for potential overdispersion™. All 
continuous predictor variables were standardized to a mean of 0 and 
an s.d. of 1 before modelling. To test whether changes in the overall 
number of species were associated with changes in overall abundance, 
we ran additional models with the number of species as response and 
log-transformed abundance as covariate. To assess the contribution of 
individual years to the overall trend, we repeated the models for overall 
biomass, abundance and number of species, and excluded data from 
one year each time. In addition, we tested whether the observed effect 


of year differed from a random expectation by randomizing the order 
of years 100x for forests and grasslands before modelling. 

To test for differences between dispersal groups, we fitted models for 
biomass, abundance and number of species in which effects of year, local 
and landscape-level land-use intensity (as well as their interactions) were 
estimated specifically for each dispersal guild. These models included 
response values for each group per siteand year, and dispersal group (weak 
or strong) as fixed effect. To test whether observed effects differed sig- 
nificantly between dispersal guilds, we fitted additional models including 
the three three-way interactions between dispersal guild, year and each 
of the three land-use variables. All models included the site nested in the 
region as arandom effect to account for spatial arrangement and tempo- 
ral repetitions per site. Poisson models included an observation-specific 
random effect to account for potential overdispersion. 

In addition to models for data aggregated per site and year, we fitted 
models for biomass, abundance and number of species at the level of 
individual observations (two collections per year for grasslands and 
five collections per year for forests), which could account for seasonal 
differences and weather conditions at the time of sampling. For forest 
data from 30 sites, fixed effects included mean winter temperature, 
mean temperature and precipitation during sampling period, length 
of sampling period (in days), Julian date of the day on which traps were 
emptied, local and landscape-level land-use intensity (cover of arable 
fields and cover of grassland within a radius of 1,000 m), as well as inter- 
actions between year and local land-use intensity, and between year 
and landscape-level land-use intensity. For grasslands, fixed effects 
included mean winter temperature, precipitation during the growing 
season and their interaction, mean temperature and precipitation on 
the day of sampling, Julian date of the day of sampling, local land-use 
intensity and landscape-level land-use intensity (cover of arable fields 
and cover of grassland within a radius of 1,000 m), as well as interac- 
tions between year and local land-use intensity, and between year and 
landscape-level land-use intensity. Models included the site nested in 
the region as a random effect to account for the nested design and the 
repeated measures at the site level. Poisson models included an obser- 
vation-specific random effect to account for potential overdispersion”®. 
To allow nonlinear effects for day of sampling, we fitted generalized 
additive models (function gamm4 in package gamm4). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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This work is based on data from several projects of the Biodiversity 
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for analyses are publicly available from the Biodiversity Exploratories 


Information System (https://doi.org/10.17616/R32P9Q) at https://www. 
bexis.uni-jena.de/PublicData/PublicDataSet.aspx?DatasetId=25786. 
Raw data are publicly available from the same repository (with identifiers 
21969, 22007, 22008, 19686 and 20366), or will become publicly available 
after an embargo period of five years from the end of data assembly to 
give the owners and collectors of the data time to perform their analysis. 
Any other relevant data are available from the corresponding author 
upon reasonable request. 
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Extended Data Fig. 1| Effects of weather variables on arthropod numbers. 
Effects of mean winter temperature (November to February) and precipitation 
during the growing period (March to October) on biomass, abundance and 
number of species in arthropod communities in 30 forests (orange) and 

150 grasslands (blue) across three regions of Germany. Dots represent raw data 
aggregated per site and year (n=1,406 (grassland) or 266 (forest) independent 
samples). Dotted lines indicate non-significant (P= 0.05) and solid lines indicate 
significant effects of weather variables (P< 0.05), based on linear mixed models 
that included year, local and landscape land-use intensity as covariates. Shaded 
areas represent confidence intervals. The effects of winter temperature and 
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precipitation differed between forests and grasslands. In grasslands, arthropod 
numbers increased with increasing winter temperature and with increasing 
precipitation in the growing period; the effect of precipitation was weaker than 
the effect of winter temperature, and the effects of both weather variables were 
weaker than the effect of the year (Supplementary Table 1-1). In forests, 
arthropod numbers decreased with increasing winter temperature and with 
increasing precipitation in the growing period; the effects of the two weather 
variables were similarly strong, but slightly weaker than the effect of the year 
(Supplementary Table 1-1). 
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Extended Data Fig. 2|See next page for caption. 
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Extended Data Fig. 2| Contribution of individual years to overall trends. 

a, To assess the contribution of individual years to the overall trend, we repeated 
the linear mixed models for overall biomass, abundance and number of species, 
and excluded one year each time. The distribution of tand z values for the effect 
of the year from subset models (white), and from the full models including all 
years (black), are shown (11 models for grasslands and 10 models for forests). 
Grey bars denote effect of the year 2008 (the year with the strongest effect on 
overall trend estimates). b, In addition, we tested whether the observed effect of 
year differed froma random expectation by randomizing the order of years 100x 
for forests and grasslands before modelling. The distribution of tand z values for 


the effect of the year from models with randomly ordered years (white) and 
models with the years ordered correctly (black) are shown (101 models each for 
grasslands and forests). Vertical dashed lines indicate levels of significance with 
P<0.05. The results ina show that both weaker and stronger temporal trends 
could be detected when single years were excluded from the analysis, compared 
to the full model including all years. Results in b show that models with the years 
ordered randomly produced effects of the year that were normally distributed 
around zero, and only the models with years ordered correctly generated strong 
temporal trends. For a more detailed discussion, see Supplementary 
Information section 3. 
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Extended Data Fig. 3 | Declines in gamma diversity of frequent species. 
Estimated gamma diversity (total number of species across all grassland or 
forest sites) over time. Symbols and error bars shown mean and 95% confidence 
intervals for gamma diversity, calculated as incidence-based, bias-corrected 
diversity estimates (Chao’s BSS”, with 200 bootstrapping runs; Methods) for 
q=1and 2 (for q=0, see Fig. 1). With increasing order q, the more-frequent 
species are more strongly weighted (q=0 equals species richness, g=1 equals 
the exponential of Shannon entropy and g=2 equals the inverse of Simpson 
diversity; that is, only dominant species affect the diversity measure). This 
approach accounts for slight differences in site numbers between years caused 
by limited accessibility or failure of traps. Non-overlapping confidence intervals 
indicate a significant difference between two sampling years*’. Figure 1 shows 
that gamma diversity declines in both forests and grasslands for g=0. We find 
that in forests gamma diversity declines when only the more-common species 
are considered (q=1and q=2), whereas in grasslands there is no overall decline 
when only the common species are considered. For amore detailed 
interpretation, see Supplementary Information section 4. 
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Extended Data Fig. 4 | Effect of tree mortality on arthropod trends. a, The 
relative change inthe number of arthropod species between the first two and 
the final two study years was similar for managed (n= 19) and unmanaged (n= 9) 
forest sites (z= 0.648, P=0.517, derived froma linear mixed model with relative 
difference in species number as response, harvesting category as fixed and 
region as random effect). Dots show raw data per site. Boxes represent data 
within the 25th and 75th percentile, black lines show medians, and whiskers show 
1.5x the interquartile range. b, When considering actual tree mortality between 
forest inventories in 2009 and 2016, declines in the number of arthropod species 
were weaker at sites with higher tree mortality (z=2.536, P=0.011, derived from 
alinear mixed model with relative difference in species number as response, tree 
mortality as fixed and region as random effect). Dots show raw data per site. The 
blue line visualizes the significant relationship between the change in the 
number of arthropod species and tree mortality based on the linear mixed 
model, and the shaded area represents confidence intervals. This suggests that 
changes in habitat conditions and heterogeneity linked to tree mortality—such 
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as increasing canopy openness, herb cover or deadwood availability— 
moderated declines in the number of arthropod species. More research is 
needed to identify mechanistic relationships. Tree mortality included both 
natural mortality and timber harvesting. Forest sites had a stand age of, on 
average, 116 years (minimum of 30 years and maximum of 180 years) and 
therefore did not include overmature stands. Owing to stand age and because 
management was abandoned 20 to 70 years before this study started, natural 
tree mortality was low even in unmanaged stands. We expect increasingly 
positive effects of natural tree mortality and associated increased structural 
diversity and heterogeneity* on arthropod trends with increasing stand age, 
but further research is required. In Germany, harvesting is usually conducted as 
shelterwood cutting. In our sites, the harvested amount over the course of our 
study reached amaximum of 1% of the standing volume per year. More intense 
harvesting systems (suchas clear cutting), which lead to less heterogeneous 
habitat conditions, may not have similar moderating effects on arthropod 
declines. 
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Extended Data Fig. 5 | Distribution of landscape-level land-use variables. 


Data distribution of the cover of arable fields, grassland and forest within 
1,000 msurrounding each of the 150 grassland and 30 forest sites for each 
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Extended Data Fig. 6|See next page for caption. 
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Extended Data Fig. 6 | Correlations among weather and among land-use 
variables. a, b, Coefficients of pairwise correlations and PCAs for weather 
variables (a) and land-use variables (b). Temperature-related data are based on 
observed air temperature by weather stations at each site. Precipitation is 
derived from gauge-corrected radar observations (RADOLAN, Deutscher 
Wetterdienst). For each site and year, we calculated mean temperature (7 mean), 
number of frost days (daily minimum temperature <0 °C; n frost), number of 
warm days (daily mean temperature >20 °C; n warm days) and precipitation sum 
inmm (precipitation) for three different periods: winter (November of the 
previous year to February; win), growing period (March to October; grow) and 


year (November of the previous year to October; year). The number of 
independent observations for weather variables was n=1,406 (grasslands) or 
266 (forests). Land-use variables include local land-use intensity (local LU) and 
cover of arable fields (A), grassland (G) and forest (F) at different spatial scales 
(250, 500, 1,000, 1,500 and 2,000 m). The number of independent observations 
for land-use variables equalled the number of sites; n=150 (grasslands) or n=30 
(forests). On the basis of correlations and PCA results, we chose mean winter 
temperature and precipitation during the growing period, as well as cover of 
arable fields and cover of grassland, as ecologically meaningful and the least- 
correlated explanatory variables for modelling arthropod data. 
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Extended Data Fig. 7 | Temporal patterns in weather conditions. Temporal 
patterns of the sum of precipitation during the growing period (Marchto 
October) and mean winter temperature (November of the previous year to 
February) for 150 grassland and 30 forest sites (n=1,406 (grassland) or 266 
(forest) independent observations). Boxes represent data within the 25th and 
75th percentile, black lines show medians and whiskers show 1.5x the 
interquartile range. A linear mixed model for each response variable, with year 
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as a fixed effect and the site nested inthe region as arandomeffect, indicate that 
winter temperature increased (grassland, z=10.90, P< 0.001; forest, z=8.24, 
P<0.001) and precipitation during the growing period decreased during our 
study period (grassland, z=—6.53, P< 0.001; forest, z=—8.44, P< 0.001). Weare 
currently not able to quantify whether and how much the observed trends in 
arthropod numbers were affected by changes in climatic conditions 
(Supplementary Information section 2). 
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Extended Data Fig. 8 | Results from multiscale analysis. Mean and s.d. of 
Pearson’s coefficients of correlation between arthropod numbers (biomass, 
abundance and number of species) and landscape-level land-use variables 


correlations was repeated 100 times (median number of sites per subsample was 
n=18 (grassland) or 17 (forest)). The 1,000-m scale was used for modelling 


arthropod numbers for both grassland and forests because (i) the correlation 
(cover of arable fields and cover of grassland) for radii of 250-2,000 m around coefficients appeared to plateau at this scale in grasslands, (ii) the range of 


150 grassland sites and 30 forest sites. Only data froma random subset of sites landscape-level land-use variables at small spatial scales in forests was small and 


with non-overlapping buffers at the 2,000-m scale were used. The randomized (iii) buffers of neighbouring plots overlapped more extensively at higher spatial 
subsampling of sites with non-overlapping buffers and the calculation of scales. 
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Extended Data Table 1| Details on arthropod numbers 


Grassland | Forest30_| Forest140 | Sum 
Abundance 
Identified and unidentified taxa 864,548 80,624 117,731 | 1,032,279 
Unidentified taxa 685,738 NA NA 685,738 
Identified taxa 178,810 80,624 117,731 346,541 
Species number 1,309 1,582 1,634 2,675 
Biomass[g] 5,637 2,676 4,199 11,642 
Weak disperser 
Abundance 29,979 5,744 6,580 
Species number 209 136 148 
Strong disperser 
Abundance 133,710 74,492 110,794 
Species number 946 1,375 1,418 
Carnivores 
Abundance 10,503 9,543 12,611 
Species number 344 531 517 
Omnivores 
Abundance 3,849 21,878 36,900 
Species number 179 354 382 
Herbivores 
Abundance 163,054 20,016 33,252 
Species number 720 295 341 
Myceto-/detritivores 
Abundance 1,403 29,158 34,964 
Species number 64 395 394 


Total biomass, number of individual arthropods and number of art 
150 grassland and 30 or 140 forest sites. Data are available for eac 


hropod species from 
h year from 2008 to 2017 


for all150 grassland sites, and from 2008 to 2016 for 30 forest sites. In addition, data from 
2008, 2011 and 2014 are available for 140 forest sites (including the 30 sites with annual data). 
Information regarding the abundance of arthropod taxa that were not identified to the species 
level was collected only in grasslands and not in forests. Classification as a weak or strong 
disperser was based on morphological and behavioural characteristics (Methods). Owing to 
missing information, not all species could be assigned to a dispersal or trophic group. 
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For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
Ld AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Lu Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection No software was used to collect data. 


Data analysis Data analyses were conducted using R version 3.5.1 including the packages vegan (function decostand) , Ime4 (function glmer and Imer), 
reshape2 (function dcast), BiodiversityR (function rankabundance), stats (prcomp), iNext and ggplot2 for graphics; 
Climate data was processed using the software TubeDB (https://environmentalinformatics-marburg.github.io/tubedb). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All data used for analyses are publicly available at the Biodiversity Exploratories Information System (http://doi.org/10.17616/R32P9Q) https://www.bexis.uni- 
jena.de/PublicData/PublicDataSet.aspx?DatasetId=25786. Raw data are publicly available from the same repository (IDs: 21969, 22007, 22008, 19686, 20366) or will 
become publicly available after an embargo period of five years from the end of data assembly to give data owners and collectors time to perform their analysis. 
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these points even when the disclosure is negative. 


To disentangle local and landscape-level effects of land use on the temporal trends of arthropod communities in grasslands and 
forests, we used our arthropod data from the Biodiversity Exploratories research program, including more than 1 million individuals 
and 2,700 arthropod species. Arthropods were collected annually during the growing period from 2008 to 2017 by standardized 
sampling at 150 grassland plots and from 2008 to 2016 at 30 forest plots. An additional 118 forest plots were sampled in 2008, 2011 
and 2014 to test the overall trend across a larger number of plots. Both grassland and forest plots cover gradients in local land-use 
intensity. Land-use intensity was quantified in the form of compound indices based on grazing, mowing and fertilization intensity in 
grasslands, and on recent biomass removal, the proportion of non-native tree species and deadwood origin in forests. To analyze 
landscape-level effects, we quantified the cover of arable fields, grassland and forest within a 2 km radius around each sampling plot. 
We modelled temporal trends in biomass, abundance and species number of arthropods and of different dispersal and trophic guilds 
separately for grasslands and forests, and tested for effects of local and landscape-scale land-use intensity on these trends, 
accounting for weather conditions and different spatial scales. 


The sample unit is the arthropod community of a grassland or forest plot within a given year. It is characterized by measures of 
biomass, abundance and species number. A sample unit is considered to represent arthropod populations at our study sites which 
measured 50m x 50m in grasslands and 100m x 100 m in forests. Data from different months and traps per plot were pooled per plot 
and year. 


Standardized sweep-net sampling along 3 50m transects were conducted in grasslands in June and August. These periods represent 
the start and end of the peak season for arthropods Central Europe. Sweep-netting is most efficient to sample arthropods in 
grassland habitats. 

In forests, we used 2 flight-interception traps per plot which provide a broad range of flying arthropods. Traps were operated during 
the complete growing season. 

All samples were sorted to order level in the lab and all groups for which taxonomists were available were identified to species level. 


No statistical methods were used to predetermine sample size. 


Samples were operated in the field and sorted in the lab by trained technicians. Identification was done by expert taxonomists. All 
people involved are listed in the acknowledgments section. 


Grasslands: annual data collection at all 150 plots in June and August 2008 to 2017 (peak season of arthropods in our study regions) 
Forests: annual data collection from 2008 to 2016 (April to October) at 30 plots (full growing period represented); in addition, 118 
plots were sampled (April to October) in 2008, 2011 and 2014 to test for a larger number of plots whether trends are consistent with 
results based on annual data from the 30 plots described before 


No data was excluded from the analyses 

Our data were collected as part of a monitoring over several years and which cannot be repeated. 

Study plots were selected from ~3000 candidate plots. Surveys of initial vegetation and land use were conducted on candidate plots 
by stratified random sampling to ensured that the selected plots covered the whole range of land-use intensity and to minimize 


confounding effects of spatial position or soil type. 


Investigators were not aware of the land-use intensity of the plot were they worked, but they could not otherwise be blinded during 
data collection and analyses for example with respect to the year a sample came from. 


work? Yes No 


Field work, collection and transport 


Field conditions 


Location 


Access and import/export 


Grasslands: sweep-netting was only conducted when the vegetation was dry and wind speed was low 
Forests: traps were operated at all weather conditions from April to October 


Our data were collected in three German regions: (1) Schwabische Alb in south-western Germany (420 km?, 460-860 m above 
sea level (a.s.l.)); (2) Hainich-Diin in central Germany (1560 km?, 285-550 m a.s.l.); and (3) Schorfheide-Chorin in northeastern 


Germany (1300 km?, 3-140 maz.s.|.). 


Fieldwork permits were issued by the responsible state environmental offices of Baden-Wiirttemberg (Regierungsprasidium 
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Access and import/export Tubingen, file number 55-3/8852.15), Thuringen (Thuringer Landesverwaltungsamt, file number 13.4 64233/08-08SDH) and 
Brandenburg (Landesumweltamt Brandenburg, file number RO7/SOB-0907 ). 


Disturbance Activity of investigators was spatially limited to three 50m transects for sweep-netting and short paths to access the flight- 
interception traps 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals No laboratory animals were involved in the study. 


Wild animals Arthropods of about 2700 different species were collected in the field and killed on-site using CuSO4-solution for flight- 
interceptions traps and ethanol for sweep-netting. Identification of arthropods requires killing and transport to the lab were 
microscopes can be used. 


Field-collected samples Samples were stored in 93% ethanol at 7°C except for short time periods during transport, sorting and identification. 


Ethics oversight It could not be ruled out that threatened or protected arthropod species would be collected and killed. Thus, permision was 
required from the authorities which was granted for scientific reasons. These permits were issued by the responsible state 
environmental offices of Baden-Wiirttemberg (Regierungsprasidium Tubingen, file number 55-3/8852.15), Thuringen (Thuringer 
Landesverwaltungsamt, file number 13.4 64233/08-08SDH) and Brandenburg (Landesumweltamt Brandenburg, file number 
RO7/SOB-0907 ). 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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The neural crest, an embryonic stem-cell population, is a vertebrate innovation that 
has been proposed to bea key component of the ‘new head’, which imbued vertebrates 
with predatory behaviour’”. Here, to investigate how the evolution of neural crest cells 
affected the vertebrate body plan, we examined the molecular circuits that control 
neural crest development along the anteroposterior axis of ajawless vertebrate, the sea 
lamprey. Gene expression analysis showed that the cranial subpopulation of the neural 


crest of the lamprey lacks most components of a transcriptional circuit that is specific 
to the cranial neural crest inamniotes and confers the ability to form craniofacial 
cartilage onto non-cranial neural crest subpopulations’. Consistent with this, 
hierarchical clustering analysis revealed that the transcriptional profile of the lamprey 
cranial neural crest is more similar to the trunk neural crest of amniotes. Notably, 
analysis of the cranial neural crest in little skate and zebrafish embryos demonstrated 
that the transcriptional circuit that is specific to the cranial neural crest emerged via the 
gradual addition of network components to the neural crest of gnathostomes, which 
subsequently became restricted to the cephalic region. Our results indicate that the 
ancestral neural crest at the base of the vertebrate lineage possessed a trunk-like 
identity. We propose that the emergence of the cranial neural crest, by progressive 
assembly of an axial-specific regulatory circuit, allowed the elaboration of the new head 
during vertebrate evolution. 


Gans and Northcutt’s ‘new head’ hypothesis proposed that the 
emergence of the vertebrate lineage was accompanied by the advent 
of the neural crest (NC), a stem-cell population that arises within the 
forming central nervous system (CNS) in all vertebrates!”. These 
cells subsequently leave the CNS, migrate to diverse locations and 
differentiate into many tissues, including peripheral ganglia and the 
craniofacial skeleton**. As vertebrates evolved, NC cells contributed 
to morphological novelties such as jaws, which enabled the expansion 
of vertebrates. 

It has been proposed that a pan-vertebrate NC gene regulatory net- 
work (GRN), which invokes sequential deployment of signalling and 
transcriptional events, underlies the formation of this cell type. The core 
of the NC GRNis largely conserved across vertebrates, including the sea 
lamprey Petromyzon marinus, ajawless (cyclostome) vertebrate, and 
has been primarily studied at cranial levels. However, key transcription 
factors suchas Ets1 and Twist are deployed later in the lamprey GRN than 
inamniotes®”’, suggesting that there are regulatory differences between 
cyclostomes and gnathostomes. Furthermore, some NC derivatives are 
unique to gnathostomes, including jaws at cranial levels, a vagal-derived 
enteric nervous system, and sympathetic ganglia at trunk levels®”. This 
raises the possibility that network differences in axial regionalization 


of the neural crest may have contributed to the presence of these gna- 
thostome cell types. 

In jawed vertebrates, the NC is subdivided along the body axis into 
cranial, vagal, and trunk subpopulations®. By contrast, lampreys lack 
an intermediate vagal population, suggesting that there are two major 
subpopulations: cranial and trunk®. It is unclear how axial identity in 
the lamprey is controlled at a molecular level. Avian embryos possess a 
‘cranial crest-specific’ NC GRN subcircuit that can drive the differentia- 
tion of trunk NC into ectomesenchymal derivatives’. In this subcircuit, 
the transcription factors Brn3c, Lhx5, and Dmbx1 are expressed at the 
neural plate border and, in turn, activate the expression of Sox8 in the 
premigratory cranial NC and Fts/ in the migratory cranial NC (Fig. 1a). 
By contrast, the pan-NC genes Tfap2b and Sox10 are expressed all along 
the body axis”. 


Lamprey lacks full cranial NC subcircuit 


We investigated whether this cranial subcircuit is a general feature of 
vertebrates by examining whether lampreys possess a homologous 
spatiotemporal regulatory state. Taking a candidate approach, we ana- 
lysed the expression of cranial crest subcircuit orthologues in lamprey 
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Fig. 1|Lamprey cranial NC lacks most components ofa chick ‘cranial crest 
circuit’. a, Biotapestry model of cranial NC-specific gene regulatory circuit that 
drives skeletal differentiation in amniotes. b, Expression of lamprey orthologues 
of amniote cranial NC-specific genes at T21 and T23. Blue arrows, expressionin 
the cranial neural crest (CNC); red arrows, expression in the trunk neural crest 


embryosat different developmental stages. In contrast to amniotes, in 
lampreys the premigratory and migratory NC did not appear to express 
Brn3, LhxS, Dmbx1, or Ets1 (Fig. 1b). The lack of most cranial-specific 
regulatory factors suggests a high degree of divergence between the 
early regulatory states of the lamprey and amniote NC. By contrast, SoxF1 
and Tfap2 were robustly expressed in premigratory and migratory NC 
along the entire lamprey body axis (Fig. 1b, Extended Data Fig. la—d). 
No SoxE family member was restricted in expression to the cranial NC, 
as Sox8 is in amniotes. Of note, lamprey SoxE transcription factors are 
homologous to gnathostome Sox8, Sox9 and Sox10, and SoxE paral- 
ogue usage varies across gnathostomes” °. Consistent with the lack of 
restricted ‘cranial-specific’ expression, ectomesenchymal derivatives 
have been found at trunk levels in the lamprey dorsal fin’. 

How then did this regulatory subcircuit evolve? Notably, genes from 
the amniote cranial crest subcircuit are present inthe lamprey genome 


L. erinacea 


D. rerio 


Fig. 2 | Nodes ofanearly cranial NC-specific circuit were acquired inthe 
cranial NC progressively throughout gnathostome evolution. a, Expression 
of cranial NC-specific orthologues in the little skate at stages (S) 17 and 18. Blue 
arrow, expression of orthologues in the cranial NC; red arrows, expression of 
orthologues in the trunk NC. b, Biotapestry model of the skate circuit with the 
addition of a novel node, EtsI.c, Expression of cranial NC-specific orthologues in 
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(TNC). c, Late expression of cranial NC-specific orthologues in pharyngeal arch 
NC derivatives (black arrow). d, Biotapestry model of the lamprey circuit with 
the addition of late module expression of markers in pharyngeal arch NC 
derivatives. TGG, trigeminal ganglia. Scale bars, 250 pm. Reproducibleonn=5 
embryos per time point for n>10 experiments. 


but are expressed later in pharyngeal arches populated by NC cells 
(Fig. Ic, d, Extended Data Figs. 1e-I, 2). An intriguing possibility is that 
these genes were expressed only in late NC derivatives of early verte- 
brates and were gradually co-opted to earlier developmental stages in 
gnathostomes. According to this scenario, genes involved in NC dif- 
ferentiation in early vertebrates were co-opted to the specification 
program of gnathostomesatall axial levels. With subsequent regulatory 
modifications, they became cranially restricted, possibly endowing the 
cranial NC with novel morphogenetic features while the trunk NC lost 
the ability to make cranial-like derivatives*”. 


Skates add Ets1 to cranial NC subcircuit 


To explore the transcriptional landscape ina basal jawed vertebrate, we 
examined candidate elements of the cranial NC subcircuit in the little 
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the zebrafish at 5-9 somite stage (ss) and 14ss. d, Biotapestry model of the 
zebrafish circuit with the addition of novel early nodes, /hx5 and dmbx1.FB/MB, 
forebrain/midbrain; St, stomodeum. Scale bars, 250 pm. For skates, in situ 
hybridizations were reproducible onn=2 embryos for n>2 experiments. For 
zebrafish, in situ hybridizations were reproducible onn=>5 embryos per time 
point forn=10 experiments. 
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Fig. 3 | Tissue-specific RNA sequencing comparisons between lamprey 

and chicken reveal that the ancestral NC hada more trunk-like identity. 

a, Volcano plot showing lamprey differential enrichments of cranial (blue) and 
trunk (red) genes by population RNA sequencing (RNA-seq; 100 embryos were 
dissected for each of n=2 biological replicates; adjusted P< 0.05). b, Volcano 
plot showing enrichment of genes in the cranial (blue) versus the trunk (red) NC 
in chicken (215 heads and 5S trunks were dissected and prepared for FACS for each 
of n=3 biological replicates; adjusted P< 0.05). c, Hierarchical clustering 
analysis of all RNA-seq libraries focused onthe NC GRN reveals similarities and 
differences between axial levels among species. 


skate Leucoraja erinacea, a member of a chondricthyan gnathostome 
outgroup to the bony fishes. Sox9 and Sox10 of the SoxE genes, as well as 
Tfap2b and Fts1, were expressed at all axial levels and not restricted to the 
cranial crest (Fig. 2a, Extended Data Fig. 3). Because Ets1 appears inthe 
little skate migratory NC, asin other gnathostomes, we conclude that this 
early node was a novelty acquired by the NC GRN before the divergence 
of cartilaginous and bony fishes (Fig. 2a, b, Extended Data Fig. 3). Later, 
after NC cells had migrated to and populated the pharyngeal arches, 
SoxE, Tfap2b, and Ets1 were expressed within the arches (Extended Data 
Fig. 4). Trunk NC cells in the little skate give rise to ectomesenchymal 
dermal denticles (‘cranial-like’ derivatives), consistent with our observa- 
tion that cranial subcircuit genes in the little skate are not restricted to 
the head but can drive the differentiation of skeletogenic derivatives 
inthe trunk”. In the fossil record, many stem-gnathostomes possessed 
extensive dermal armour, which has been retained, albeit with the dental 
component reduced and modified, within the gnathostome crown group 
(for example, the dermal denticles of chondrichthyans and the denti- 
nous scales of Polypterus and coelacanth). Thus, dental tissues in the 
postcranial dermal skeleton appear to be ancestral for gnathostomes. 
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Fig. 4 | Model of the evolution of NC axial levels during vertebrate evolution. 
a, Our data suggest that the ancestral NC was a more uniform population of cells 
along the body axis that underwent gradual regulatory modifications during 
gnathostome evolution. b, Progressive restriction of the ‘cranial circuit genes’ to 
the cranial axial level led to axial specialization of the NC regulatory program. 


Further elaboration of the NC subcircuit 


Notably, in the teleost Danio rerio (zebrafish), [hx5 and dmbx1 are 
expressed in the early cranial circuit but not in later pharyngeal arch 
derivatives (Fig. 2c, d, Extended Data Figs. 5, 6). In addition, sox8b, sox10, 
tfap2a, and ets1, but not brn3c (also known as pou4f3) are expressed in 
premigratory and migratory crest at all axial levels (Fig. 2c, Extended 
Data Fig. 5). Rather than being restricted to the cranial NC, many of these 
factors are also expressed in the zebrafish trunk, raising the possibility 
that the resolution of axial level potential may have arisen within sar- 
copterygians. Furthermore, in situ analysis of pharyngeal arch deriva- 
tives in both little skate and zebrafish lend support to temporal shifts of 
cranial specific regulatory nodes from later NC derivatives to an early 
specification program (Extended Data Figs. 4, 6). With loss of nodes 
from late derivatives and addition to an earlier program, this suggests 
that regulatory modifications arose gradually throughout gnathostome 
evolution (Extended Data Fig. 1m). 


Lamprey cranial NC is more trunk-like 


Our candidate gene approach suggests that extensive changes occurred 
in the NC regulatory state between jawless and jawed vertebrates. To 
investigate further, we conducted a comparative transcriptome analysis 
of cranial and trunk NC subpopulations in lamprey and chicken (Fig. 3). 
We obtained premigratory lamprey NC by micro-dissecting segments 
of cranial and trunk dorsal neural tubes at Tahara stages T21 and T23.5. 
For chick, we isolated premigratory NC populations using enhancers 
that drive eGFP expression in cranial or trunk neural crest populations 
for fluorescence-activated cell sorting (FACS) at Hamburger-Hamilton 
stages HH9+ and HH18, respectively. After cDNA library preparation 
and sequencing, differential expression analysis showed that far fewer 
genes were significantly enriched in lamprey cranial NC versus trunk 
NC, compared with chick cranial NC versus trunk NC (1,233 genes in 
lamprey compared with 2,794 in chicken; Fig. 3a, b). 

To better understand how each library correlated to the others 
in an unbiased fashion, we mapped each to a common reference 
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transcriptome, created by aligning proteomes using BLAT” and compil- 
ing matching sequences as a consensus alignment between species for 
Bowtie mapping (Supplementary Table 1). We next performed hierarchi- 
cal clustering analysis of all known NC GRN genes (Fig. 3c). Consistent 
with our previous in situ hybridization analysis (Fig. 1b), Tfap2 (known 
as TFAP2A in chicken) was enriched at all axial levels in both chicken and 
lamprey, SOX8 was enriched in chicken cranial NC but in both cranial 
and trunk lamprey populations, and DMBX1 and ETS1 were enriched in 
chick but not lamprey cranial datasets (Fig. 3c). 

Notably, we found that lamprey cranial NC populations correlated 
more closely to chicken trunk than lamprey trunk libraries, suggesting 
that basal NC was ‘trunk-like’ in its regulatory program (Fig. 3c). These 
results suggest that cyclostomes possess a simpler and more trunk-like 
cranial crest GRN, with potentially important implications for the evo- 
lution of NC subpopulations (Fig. 4a). Accordingly, we speculate that 
the ancestral neural crest may have been relatively homogeneous and 
trunk-like. Throughout evolution of the vertebrate lineage, we propose 
that key transcription factors were progressively co-opted into anearly, 
cranially restricted circuit, whereas some features, suchas skeletogenic 
potential, were lost from the trunk. 


Conclusions 


The differences in expression of axial level-specific genes contrast with 
the deep conservation of the pan-NC program®"*. Transcription factors 
suchas SoxE genes, Tfap2, and /d may be the rudiments of a larger, more 
complex cranial crest GRN that was expanded during early vertebrate 
evolution with the incorporation of genes such as Dmbx1, Brn3c, Ets1,and 
Lhx5. Consistent with these findings, the basal chordate amphioxus lacks 
expression at the neural plate border of genes such as Dmbx, Brn3, and 
Ets, as wellas core NC genes suchas SoxE, FoxD, Tfap2, and /d, although 
these genes are expressed in other tissues” ~”. Our observations also 
showthat some of these ‘novel’ genes are expressed at later stages of NC 
formation in lamprey, consistent with the possibility that elaboration of 
the GRN might have involved the co-option of parts of differentiation 
programs to earlier portions of the network, perhaps by acquisition of 
new regulatory elements responsible for their heterochronic shift”’. 
Thus, the pan-NC program was probably the ancestral molecular recipe 
to make the NC, with the subsequent elaboration of axial level-specific 
regulatory programs conferring important differences in developmental 
potential along the body axis. Given that many key NC derivatives are 
gnathostome innovations, we hypothesize these derivatives may have 
been gained as the result of gene regulatory differences associated with 
axial level-specific regulatory programs. 

Together, our results suggest the following scenario to explain the 
evolution of NC subpopulations (Fig. 4). We suggest that the NC of early 
vertebrates was uniform and similar to amniote trunk populations, 
and that the division of the NC into cranial and trunk subpopulations 
occurred early in vertebrate evolution (Fig. 4a). Consistent with evolu- 
tionary expansion of NC cells in the vertebrate lineage, our molecular 
analysis of the cranial NC reveals unexpected differences between lam- 
preys and their gnathostome counterparts (Fig. 4b). Given that the Hox 
code was already linked to segmentation of the CNS in basal vertebrates, 
posteriorizing influences of Hox genes and other factors may be suf- 
ficient to account for the subtle transcriptional differences observed 
between these two populations. We cannot rule out the possibility 
that cyclostomes lost NC subpopulations during the course of evolution. 
However, the relative scarcity of cranial NC-specific subcircuit factors in 
the lamprey cranial crest might suggest that the gnathostome cranial NC 
GRN has undergone extensive elaboration froma regulatory standpoint. 
Thus, we propose that regionalization of the NC, with both emergence 
of new subpopulations and expansion of the cranial crest GRN, was a 
key element for driving the evolution and expansion of gnathostomes 
during vertebrate evolution. 
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What does this mean for the new head hypothesis? We posit that the 
NC component of the new head, rather than arising in toto at the base of 
the vertebrate lineage, underwent continued regulatory modifications, 
evolving gradually during the course of vertebrate evolution. Our data 
suggest that early vertebrates possessed a relatively simple NC that ini- 
tially arose asa fairly uniform population along the body axis and lacked 
region-restricted regulatory programming. During gnathostome evolu- 
tion, the cranial NC appears to have gained regulatory complexity that 
modulated its differentiation capacity, gaining some individual cell fates 
while restricting others. We propose that co-option of distinct genes into 
acranial NC-specific module enabled this progressive specialization of 
NC regulatory programs, leading to the specific axial subpopulations 
and morphological novelties of the gnathostome body plan. 
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Methods 


Animal husbandry and embryo collection 

Adult sea lamprey were obtained from the US Fish and Wildlife Service 
and Department of the Interior. Embryos were cultured according to 
previously published protocols and staged according to Tahara staging 
methods””*, All lamprey embryology work was completed in compli- 
ance with California Institute of Technology Institutional Animal Care 
and Use Committee (IACUC) protocol 1436. Skate eggs were obtained 
by the Marine Biological Laboratory (MBL) Marine Resource Center, and 
embryos were cultured as previously described”. All skate embryology 
work was compliant with animal protocols approved by the IACUC at 
the MBL. Adult zebrafish were maintained in the Beckman Institute 
Fish Facility at Caltech, and all animal and embryo work was compliant 
under approved IACUC protocol 1346. Fertilized chicken eggs were 
obtained from a local farm in Sylmar, CA. No statistical methods were 
used to predetermine sample size for analyses. For in situ hybridization, 
embryos were pooled from different breeding pairs (fish), brooding 
stocks (skates), or embryo batches (lamprey) to ensure replication of 
results in multiple fixed collections. 


Cloning of lamprey, skate, and zebrafish orthologues 

RNA was extracted from desired embryo stages using the RNAque- 
ous-Micro Kit (Thermo Fisher Scientific), and cDNA was synthe- 
sized using a SuperScript III Reverse Transcriptase Kit (Invitrogen). 
The following gene-specific primers were used to amplify probe 
template sequences (accession numbers in parentheses): Pm7fap2 
(MN410935): F: 5’-GCATCGCGACAGTTGTTTGCTG-3’; R: 5’-GATG 
CTGTGGTGCCCTAATCC-3’; PmSoxE2 (MN410934): F: 5’-CGAGTC 
ACGTGGATCTGCTGC-3’; R: 5’-CCGTCCAGCACT TGACTCACG-3’; 
PmSoxE1 (MN410933): F: 5’-CGGGCTGAGTCAT TACTCGCATCG-3’; R: 
5’-CTCTCGTCGCTGTCGGAAGC-3’; PmEtsIa (SIMRbase: PMZ-0040201): 
F:5’-GGACCT TCAAGGAGTACATGAGC-3’; R: 5’-GAGAGCGGTACTCGTGG 
AAAGTC-3’; PmDmbx1 (Ensembl: ENSPMAGO0000008114): F: 5’-GCGC 
ATGAATACCGGCCGTCG-3’; R: 5’-TTGCT T TGATGCTGT TACAAGG-3’; 
PmLhx5 (MN410936): F: 5’-CGTGCGT TCGTGACCCCATC-3’; R: 5’-GAGG 
CCAGGTAGTCCTCCTTG-3’; PmBrn3 (SIMRbase: PMZ-0005302): F: 
5’-CGAGTCTCCT TAACGCGT TAGCTC-3’; R: 5’-GCTCTGGTGGGAGAC 
AATATCCACG-3’; Le Tfap2b (MN410937): F: 5’-TCCCACT TCCACAGAAG 
AAT-3’; R: 5’-TCCTTGTCTCCAGTTT TGGTG-3’; LeSox10 (MN410938): F: 
5’-ACCCCCGTTCTGTGTGTCT-3’; R: 5’°-GGCAGGTACTGGTCGAACTC-3’; 
LeSox9 (MN410939): F: 5’-CCCAGCCACTACAATGAGCAG-3’; R: 5’-CCGT 
ACGGCATCAGCAAATG-3’; LeSox8 (MN410944): F: 5’-CAACTCCGCC 
CACCACTCC-3’; R: 5’-TGGCCTAGTCAGGGT TGTGTAG-3’; LeFts1 
(MN410940): F: 5’-TTCAGCCTGAAGAACGTGGAC-3’; R: 5’-GCAAGACTTG 
TCCGTCAGGAG-3’; LeDmbx1 (MN410941): F: 5’-CAATCAACACGACAGGG 
ACA-3’; R: 5’-GTAAGCTGTCAAGCCCCAGA-3’; LeLhx5 (MN410942): F: 
5’-TCATCGACGAAAACAAATTTGTGTG-3’; R: 5’- TGAATAACCCGCATG 
TTGAGGC-3’; LeBrn3c (MN410943): F: 5’-CTTCAAGCCGGACATCACC 
TAC-3’; R: 5’-TAGATCCCTGCTTGTTCCTGC-3’; Drtfap2a (NM_176859): 
F: 5’-GTCACGGCAT TGATACTGGACTC-3’; R: 5’-TCAT TGGCACACTG 
CTTTACTGAT-3’; Drsox10 (NM_131875): F: 5’-GTGAAACACACTTCC 
CTGGGGATAC-3’; R: 5’-GTGGAGACATGTGTGTATGGCGTC-3’; 
Dr sox8b (NM_001025465): F: 5’-ATGAGCGAGGAGCGGGAAAAGTG-3’; 
R: 5’-GGGTCTGGACAGAGTGGTGTAGAC-3’; Drets1 (NM_001017558): 
F: 5’-CAGACAGCGGATCTTGTTGAGGGA-3’; R: 5’°-CAGTCCAGCTGATG 
AAGGACTGG-3’; Drdmbx1a (NM_152977): F: 5’-CGTGCCAGTCCTACT 
ATCAGTCTC-3’; R: 5’-CTGCTGTGTAGTGCATGCAACC-3’; Drlhx5 
(NM _131218): F: 5’-CACGGACATGATATCCCATGCAGAC-3’; R: 5’-CTAGCTC 
ACTTCTGACCATCAGATGC-3’; Drbrn3c (NM_131278): F: 5’-ATGATGACC 
ATGAACGGCAAGC-3’; R: 5’-GTGCACTGCTGAATACT TCATCC-3’. 


Phylogenetic analysis of Dmbx proteins 
Candidate Dmbx sequences were assembled as an ungapped Fasta 
file and imported into the TCoffee server (http://tcoffee.crg.cat) and 


processed using default parameters in an Expresso (http://tcoffee.crg. 
cat/apps/tcoffee/do:expresso) into a protein alignment” *. The TCoffee 
fasta alignment was imported into MegAlign Pro (DNAstar ver 15.0.0) and 
ambiguous regions with poor alignment scores were removed, leaving 
only large, contiguous regions of well-aligned sequence. This alignment 
of 218 amino acid resides was exported in nexus format. The start of 
the P. marinus sequence is missing, and so residues were recoded from 
gaps to indicate missing sequence. The file was modified to include a 
MrBayes block, with aamodellpr = mixed, stopval = 0.01, ngen=200000, 
and burninfrac = 0.25. The file was executed within MrBayes3.2.1, and 
resulting consensus tree visualized in FigTree v.1.4.2 to show posterior 
probabilities (as percentage) at corresponding branch labels. Image out- 
put files from Megalign Pro and FigTree v.1.4.2 were combined in Adobe 
Illustrator 2019 (Adobe Creative Suite 2019) (Extended Data Fig. 2). 
NCBI accession numbers or Ensembl identifiers for Dmbx sequences 
used in phylogenetic analyses are as follows: XP_003725762.1 (S. pur- 
puratus); NP_001161526.1 (S. kowalevskii); AAT66431.1 (B. floridae); 
Ensembl ENSPMAGO0000008114 (P. marinus); XP_020369662.1 (R. 
typus); AAI34895.1 (Dmbxla, D. rerio); NP_001017625.1 (Dmbx1b, D. rerio); 
XP_017949066.1 (X. tropicalis); XP_001234036.2 (G. gallus); NP_671725.1 
(H. sapiens). 


In situ hybridization, sectioning, imaging, and biotapestry 
modelling 

Whole mountin situ hybridization was performed using previously pub- 
lished protocols*"®. Cryosections of lamprey, skate, or zebrafish embryo 
in situs were sectioned at 18 pm with a Microm HMS550 cryostat. In situ 
analysis of S25 skate embryos sections was performed using paraffin 
sections as follows: after fixation, embryos were embedded in paraffin 
and sections were prepared at 5-tum thickness on a Zeiss microtome. 
After paraffin removal with histosol, sections were hybridized with 
Ing/l antisense digoxygenin-labelled probes overnight at 70 °Cina 
humidifying chamber. After hybridization, sections were washed with 
50% formamide/50% 1x SSCT buffer followed by washes with MABT 
and a blocking step in 1% Roche blocking reagent. Sections were then 
incubated overnight at room temperature with a 1:2,000 dilution of 
anti-DIG-alkaline phosphatase antibody (Roche). After several washes 
with MABT, chromogenic colour was developed using NBT/BCIP pre- 
cipitation (Roche). Imaging was performed on a Zeiss Axiolmager.M2 
equipped with an Apotome.2. Gene network models were assembled 
using Biotapestry™. 


Chicken embryo electroporation, dissociation, and cell sorting 
Cranial and trunk NC cells were labelled using the previously published 
NC enhancers FoxD3-NC1.1 and FoxD3-NC2, respectively”. To isolate 
cranial NC cells, stage HH4 embryos were bilaterally electroporated 
with FoxD3-NC1.1>eGFP and cultured ex ovo until stage HH9+**. For 
each biological replicate, at least 15 embryo heads were dissected in 
Ringer’s solution and washed three times in chilled 1x PBS. For trunk NC 
cells, stage HH10 embryos were bilaterally electroporated with FoxD3- 
NC2>eGFP and cultured in ovo until stage HH18. Five embryo trunks 
spanning the length of five somites were dissected in Ringer’s solution 
and washed three times in chilled 1x PBS. The tissues were dissociated 
in Accumax (Innovative Cell Technologies, Inc.) for 15 min at 37 °C and 
GFP* cells were collected using FACS. 


Library preparation and sequencing 

Chicken libraries were prepared using the SMART-Seq Ultra Low Input 
RNA Kit (Takara) according to the manufacturer’s protocol. For lam- 
prey embryos, tissue was dissected from the cranial dorsal neural 
tubes of n=100 T21 and trunk neural tubes of n= 100 T23.5 embryos. 
Total RNA was extracted using the RNAqueous kit (Ambion). RNA-seq 
was performed at the Millard and Muriel Jacobs Genetics and Genom- 
ics Laboratory (California Institute of Technology, Pasadena, CA) at 
50 million, single-end reads on two biological replicates for both the 
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T21 cranial and T23.5 trunk neural tube samples. Sequencing libraries 
were built according to Illumina Standard Protocols. SR50 sequencing 
was performed on a HiSeq Illumina machine. 


Statistical analysis of lamprey and chicken axial population 
RNA-seq 

To identify orthologous genes between lamprey and chicken, the lam- 
prey proteome obtained from SIMRbase® was aligned to the chicken 
proteome using the BLAT alignment software available on the UCSC 
genome browser’””.. In brief, every lamprey protein sequence was que- 
ried locally against the chicken proteome, following which regions with 
the longest alignment were matched to the respective chicken genes. 
Using this alignment-based approach, proteins with an alignment per- 
centage score between 52 and 100 (see Supplementary Table 1 for exact 
scores for each orthologue) were identified as orthologues, and their 
respective cDNA sequences were obtained from the chicken and lamprey 
databases. Chicken cranial and trunk libraries were aligned to the chicken 
sequences, while the lamprey cranial and trunk libraries were aligned 
to the lamprey sequences using Bowtie2”. Transcript counts were cal- 
culated using HTSeq-Count and differential gene expression analysis 
was performed using DESeq2**””. Using chicken gene annotations as a 
reference, we added the transcript counts for duplicated orthologues 
found in the lamprey genome to calculate an ‘aggregated’ transcript 
count for each gene. These aggregated transcript counts were then 
normalized using the formula: 


T,- min(T) 
@ max(T)- min(T) 


where Z, is the normalized transcript count and 7, is the absolute tran- 
script count. A subset of genes previously identified as being part of the 
neural crest gene regulatory network® was then isolated from the count 
matrix and plotted as a heatmap to obtain the gene expression matrix. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

All raw sequencing data for all RNA-seq libraries (Fig. 3) and merged 
reference transcriptomes are available online (NCBI BioProject# 
PRJNA497902). Sequences of in situ probe templates for Figs. 1b, c, 2a,c 
are available through GenBank accession codes (see Methods). 


Code availability 


Code used to analyse sequencing datasets are available from the 
corresponding author upon request. 
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Extended Data Fig. 1| Heterochronic shifts of cranial specific gene expressed in pharyngeal arch derivatives, with the exception of Brn3, whichis 
regulatory nodes from later neural crest derivatives to an early expressed in the NC-derived cranial sensory ganglia” in lamprey frontal 
specification program throughout gnathostome evolution. a-d,Expression — sections. m, Gene expression matrix summarizing the heterochronic shift of 
of lamprey orthologues of amniote cranial NC-specific genes atT21(cranial)and cranial crest specific circuit nodes. nc, neural crest; nt, neural tube; 

T23 (trunk) in cross-section. e, Pharyngeal NC derivative expression in T26 n, notochord. Scale bars, 100 pm. Cryosections of in situ hybridizations were 
P. marinus frontal section (based on ref. 40). f-I, Cranial circuit orthologues are reproducible onn=5 embryos per time point for n>2 experiments. 
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Extended Data Fig. 2 | P. marinus Dmbxis homologous to gnathostome 
Dmbx genes. a, Truncated alignment of Dmbx protein sequences. Analignment 
of full-length Dmbx protein sequences was assembled using TCoffee and 


contiguous regions tagged by the programas poorly or moderately well-aligned 
were removed, leaving 218 well-aligned residues. b, Bayesian consensus 
phylogenetic tree, with posterior probabilities shown at corresponding nodes. 
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Extended Data Fig. 3| Expression of cranial circuit genes inthe neural crest 
of the little skate. a, Schematic of astage 18 L. erinacea embryo with the NC 
illustrated as blue (cranial) and red (trunk). b-f, Cross-sections as depicted ina. 
Scale bar, 50 pm. Cryosections of in situ hybridizations were reproducible on 
n=2embryos for n>=2 experiments. 
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Extended Data Fig. 4 | Pharyngeal neural crest derivative expression of 
cranial circuit orthologues in stage 25L.erinacea embryos. a, Schematic of 
head of L. erinacea embryo. Dashed box represents the region of the head for 
each embryo shown in b-i (left); purple dashed line depicts the location of the 
frontal section for b-i (right). b-f, Right, pharyngeal NC derivative expression of 
cranial circuit orthologues. g-i, Right, Dmbx1, LhxS, and Brn3c are absent in 
pharyngeal arch derivatives at stage 25. b-i, Scale bars, 500 pm (left); 100 pm 
(right). In situ hybridizations were reproducible onn=2 embryos. 


b:d Danio rerio 


Extended Data Fig. 5 | Expression of cranial circuit genes in the NC of the zebrafish. a, Schematic of a14ss D. rerio embryo with the NC illustrated as blue (cranial) 
and red (trunk). b-h, Cross-sections as shown ina. Scale bars, 50 pm. In situ hybridizations were reproducible onn=10 embryos. 
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Extended Data Fig. 6| Expression of cranial circuit orthologues in ine-h. e-h, Expression of cranial circuit orthologues in pharyngeal arches. 
pharyngeal NC derivatives of D. rerio embryos at 3 days post fertilization. b-d, Dmbx1, Lhx5, and Brn3c are absent from pharyngeal arch derivatives at 
a, Schematic of head of zebrafish embryo at 3 days post fertilization (dpf). 3 dpf. Scale bar, 150 pm. In situ hybridizations were reproducible onn=10 


Purple dashed line depicts the location of the frontal sections for the insets whole-mount and cryosectioned embryos. 
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Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


[| An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


[| A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


| For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection Zeiss Zen, Illumina HiSeq2500, WinList from Verity Software House 
Data analysis BLAT alignment software, R v3.6.1, Bowtie2, HTSeq-Count, DESeq2, TCoffee, Expresso, MegAlign Pro (DNAstar 15.0.0), MrBayes3.2.1, 
FigTree 1.4.2, Biotapestry, Adobe Illustrator CS5.1 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All raw sequencing data for all RNAseq libraries (Figure 3) and merged reference transcriptomes are available online (NCBI BioProject# PRJNA497902). Sequences of 
in situ probe templates for Figures 1B, 1C, 2A, and 2C are available through GenBank accession codes found in the methods. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size For chicken RNAsegq replicates, a pre-determined number of a cells was collected per kit instructions for the SMART-seq v4 ultra low input 
RNA kit (Cranial Rep 1= 1534 cells, Cranial Rep 2= 1530 cells, Cranial Rep 3= 1527 cells; Trunk Rep 1= 1500, Trunk Rep 2= 721, Trunk Rep 3= 
958). For lamprey dissections, approximately 100 embryos were dissected for each replicate of each axial level for RNAseq libraries. No 
statistical tests were used to determine sample size. The sample size provided enough RNA for subsequent library preparation. 


= 
ja¥) 
iF 
e 
= 
o 
= 
o 
Za) 
@ 
fev) 
= 
a 
=> 
= 
io 
ne) 
Oo 
ee 
= 
a 
Za) 
e 
= 
= 
fev) 
5 
< 


Data exclusions No datasets were excluded from this analysis. 


Replication Chicken RNAseg libraries were collected in triplicate for biological replicates. Lamprey RNAseq libraries were collected with replicates, as well, 
for biological replicates. All attempts at replication were successful. For in situ hybridization, embryos were pooled from different breeding 
pairs (fish), brooding stocks (skates), or embryo batches (lamprey) to ensure replication of results in multiple fixed collections. All in situ 
expression patterns were replicated 100% over multiple rounds of in situs. 


Randomization — Chicken eggs obtained from a local chicken farm were incubated in multiple incubators over different days to account for inter-batch 
variability. Lamprey tissues were collected from different breeding events to account for inter-batch variability. 


Blinding Different batches of chicken embryos were separately incubated in different incubators and electroporated with fresh DNA reporter construct 


solution. Different breeding pairs for lamprey were used for tissue collections to blind for batch variability. For all trunk libraries, dissections 
were made at the same somitic levels and not based on reporter expression to ensure blinding in sample collection. 


Reporting for specific materials, systems and methods 


aterials & experimental systems ethods 
n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
| Antibodies | Flow cytometry 
| Eukaryotic cell lines |] MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Antibodies 


Antibodies used anti-Digoxigenin-Alkaline Phosphatase Fab Fragments (Roche ref#11093274910) 


Validation From the supplier: The polyclonal antibody from sheep is specific to digoxigenin and digoxin and shows no cross-reactivity with 
other steroids, such as human estrogens and androgens. Concentrations of antibody used for each animal has been previously 
reported and validated as cited in the methods section. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Adult (3 months - 2 years of age) Danio rerio, zebrafish, were maintained in the Beckman Institute Fish Facility at Caltech, and all 
animal and embryo work was compliant under approved IACUC protocol 1346. Adult zebrafish are bred via natural spawning, 
which is triggered by light-dark cycle. The fish are kept in a specialized recirculating aquatic system with mechanical, chemical, 
and biological filters to maintain water quality. The water is kept at 28 degrees C and highly oxygenated. 


Wild animals Gravid male and female sea lamprey (Petromyzon marinus) were caught in the wild and provided by the Great Lakes Fishery 


Wild animals Commission, in cooperation with its partners at the USGS Hammond Bay Biological Station, USFWS Marquette Biological Station, 
and the Department of Fisheries and Oceans, Canada. They were sent overnight in chilled, oxygenated water to the lamprey 
facility at the California Institute of Technology, where they were maintained under the parameters set in accordance with the 
Guide for the Care and Use of Laboratory Animals of the National Institutes of Health, with protocols approved by the 
Institutional Animal Care and Use Committees of the California Institute of Technology (lamprey, Protocol #1436-17). After 
spawning, the captive adult lamprey died of natural causes. 


Field-collected samples Lamprey embryos were produced by in vitro fertilization at the California Institute of Technology lamprey facility, using captive 
gravid lamprey (Petromyzon marinus) provided by the Great Lakes Fishery Commission, in cooperation with its partners at the 
USGS Hammond Bay Biological Station, USFWS Marquette Biological Station, and the Department of Fisheries and Oceans, 
Canada. Lamprey were maintained with a water tempature of 10-18 degrees C on a 15:9h light:dark cycle. 


All skate embryos were collected from wild-caught brood stock housed at the Marine Resources Centre of the Marine Biological 
Laboratory (MBL) in Woods Hole. Eggs were maintained in a flow-through seawater system with a water temp of 15 degrees C, 
on a 12h:12h light:dark cycle. Prior to fixation, all embryos were euthanized with an overdose of buffered MS-222 (1g/L in 
seawater). All embryos collection was performed in accordance with protocols approved by the MBL Institutional Animal Care 
and Use committee. 
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Fertilized chicken eggs were obtained from a local farm in Sylmar, CA. Developing chicken embryos were maintained at a 
temperature of 37 degrees C. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


ethodology 

Sample preparation Chicken embryos were dissected in Ringers and washed thrice in chilled 1x PBS. The tissues were dissociated in Accumax 
(Innovative Cell Technologies, Inc.) for 15 minutes at 372C. 

Instrument Sony SY3200 Cell Sorter 

Software WinList from Verity Software House 


Cell population abundance Collected cells were obtained and analyzed on a hemocytometer for fluorescence and viability to ensure 100% purity. 


Gating strategy Gating was assigned according to standard protocols. Clear differentials between GFP+ and GFP- populations were observed. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Green plants (Viridiplantae) include around 450,000-500,000 species” of great 
diversity and have important roles in terrestrial and aquatic ecosystems. Here, as part 
of the One Thousand Plant Transcriptomes Initiative, we sequenced the vegetative 
transcriptomes of 1,124 species that span the diversity of plants ina broad sense 
(Archaeplastida), including green plants (Viridiplantae), glaucophytes (Glaucophyta) 
and red algae (Rhodophyta). Our analysis provides a robust phylogenomic framework 
for examining the evolution of green plants. Most inferred species relationships are 
well supported across multiple species tree and supermatrix analyses, but discordance 
among plastid and nuclear gene trees at a few important nodes highlights the 
complexity of plant genome evolution, including polyploidy, periods of rapid 
speciation, and extinction. Incomplete sorting of ancestral variation, polyploidization 
and massive expansions of gene families punctuate the evolutionary history of green 
plants. Notably, we find that large expansions of gene families preceded the origins of 
green plants, land plants and vascular plants, whereas whole-genome duplications are 
inferred to have occurred repeatedly throughout the evolution of flowering plants and 
ferns. The increasing availability of high-quality plant genome sequences and advances 


in functional genomics are enabling research on genome evolution across the green 


tree of life. 


Viridiplantae comprise an estimated 450,000-500,000 species”, 
encompass a high level of diversity and evolutionary timescales’, and 
have important roles in all terrestrial and most aquatic ecosystems. This 
ecological diversity derives from developmental, morphological and 
physiological innovations that enabled the colonization and exploitation 
of novel and emergent habitats. These innovations include multicel- 
lularity and the development of the plant cuticle, protected embryos, 
stomata, vascular tissue, roots, ovules and seeds, and flowers and fruit 
(Fig. 1). Thus, plant evolution ultimately influenced environments glob- 
ally and created a cascade of diversity in other lineages that span the 
tree of life. Plant diversity has also fuelled agricultural innovations and 
growth in the human population‘. 

Phylogenomic approaches are now widely used to resolve species 
relationships’ as well as the evolution of genomes, gene families and gene 
function’. We used mostly vegetative transcriptomes for a broad taxo- 
nomic sampling of 1,124 species together with 31 published genomes to 
infer species relationships and characterize the relative timing of organ- 
ismal, molecular and functional diversification across green plants. 

We evaluated gene history discordance among single-copy genes. 
This is expected in the face of rapid species diversification, owing to 
incomplete sorting of ancestral variation between speciation events’. 
Hybridization’, horizontal gene transfer’, gene loss following gene and 
genome duplications” and estimation error can also contribute to gene- 
tree discordance. Nevertheless, through rigorous gene and species tree 
analyses, we derived robust species tree estimates (Fig. 2 and Supple- 
mentary Figs. 1-3). Gene-family expansions and genome duplications 
are recognized sources of variation for the evolution of gene function 


and biological innovations". We inferred the timing of ancient genome 
duplications and large gene-family expansions. Our findings suggest that 
extensive gene-family expansions or genome duplications preceded the 
evolution of major innovations in the history of green plants. 


Integrated analysis of genome evolution 


Because genome sizes vary by 2,340-fold in land plants” and 4,680- 
fold in chlorophyte and streptophyte green algae“, we used a reduced- 
representation sequencing approach to reconstruct gene and species 
histories. Specifically, we generated 1,342 transcriptomes represent- 
ing 1,124 species across Archaeplastida, including green plants, glau- 
cophytes and red algae. Comparing phylogenetic inferences based 
on nuclear and plastid genes (Figs. 2, 3 and Supplementary Figs. 1-3), 
we obtained well-supported, largely congruent results across diverse 
datasets and analyses. Resolution of some relationships, however, was 
confounded by gene-tree discordance (Fig. 3), which is attributable to 
factors that include rapid diversification, reticulate evolution, gene 
duplication and loss, and estimation error. 

Inferred whole-genome duplications (WGDs; that is, polyploidy) 
across the gene-tree summary phylogeny estimated using ASTRAL» 
were not uniformly distributed (Fig. 4, Supplementary Fig. 8 and Sup- 
plementary Table 2). Comparing distributions of gene duplication times 
for each species’® (Supplementary Table 3) and orthologue divergence 
times” (Supplementary Table 4) with gene-tree analyses'* (Supplemen- 
tary Tables 5, 6), we inferred 244 ancient WGDs across Viridiplantae 
(Supplementary Fig. 8 and Supplementary Table 2). Although there 


Alist of participants and their affiliations appears in the online version of the paper. 
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Fig. 1| Diversity within the Viridiplantae. a—e, Green algae. a, Acetabularia sp. 
(Ulvophyceae). b, Stephanosphaera pluvialis (Chlorophyceae). c, Botryococcus 
sp. (Trebouxiophyceae). d, Chara sp. (Charophyceae). e, ‘Spirotaenia’ sp. 
(taxonomy under review) (Zygnematophyceae). f-p, Land plants. f, Notothylas 
orbicularis (Anthocerotophyta (hornwort)). g, Conocephalum conicum 
(Marchantiophyta (thalloid liverwort)). h, Sohagnum sp. (Bryophyta (moss)). 

i, Dendrolycopodium obscurum (Lycopodiophyta (club moss)).j, Equisetum 
telmateia (Polypodiopsida, Equisetidae (horsetail)).k, Parablechnum 
schiedeanum (Polypodiopsida, Polypodiidae (leptosporangiate fern)).I, Ginkgo 
biloba (Ginkgophyta). m, Pseudotsuga menziesii (Pinophyta (conifer)). 

n, Welwitschia mirabilis (Gnetophyta). 0, Bulnesia arborea (Angiospermae, 
eudicot, rosid). p, Paphiopedilum lowii (Angiospermae, monocot, orchid). 

a, Photograph reproduced with permission of Thieme Verlag, Stuttgart®. 

b-e, Photographs courtesy of M. Melkonian. f-j, I-n, p, Photographs courtesy of 
D.W.S.k, Photograph courtesy of R. Moran. 0, Photograph courtesy of W. Judd. 


are limitations to the inference of WGD events using this approach, 
we found that comparisons of these results with 65 overlapping pub- 
lished genome-based WGD inferences revealed 6 false-negative results 
in our tree-based estimates and no false-positive results (Supplementary 
Table 2). Analyses based on whole-genome sequences are needed for 
further resolution of WGD events. 

With the exception of most Selaginella species and some liverworts 
(Fig. 1g), our analyses implicated at least one ancient WGD in the ances- 
try of every land plant lineage. By contrast, most algal lineages showed 
no evidence of WGD. Notably, the predicted sister clade of land plants 
(Fig. 2), Zygnematophyceae (Fig. le), exhibited the highest density of 
WGDs amongalgal lineages (Fig. 4), although the apparent increase in 
WGD was largely restricted to the desmid clade (Desmidiales) within 
Zygnematophyceae. 

Increased diversification rates did not precisely co-occur with WGDs 
onthe phylogeny. WGDs are expected to contribute to the evolution of 
novel gene function". For example, novel functions among duplicate 
MADS-box genes that arose through WGD have been linked to the origin 
of flowering plants””° and core eudicots”, and functional diversifi- 
cation of gene families after WGD has contributed to the evolution of 
fruit colour in tomato species” and to nodule development within 
legumes”, Consistent with previous studies with less extensive taxon 
sampling” ”’, however, we inferred lags between WGDs and increased 
species diversity. Integrated phylogenomic and functional investi- 
gations are required to gain a mechanistic understanding of the lag 


680 | Nature | Vol574 | 31 October 2019 


between WGD, the evolution of novel gene functions and their potential 
influence on diversification rates. 

Gene-family expansions (and contractions) contribute to the dynamic 
evolution of metabolic, regulatory and signalling networks”>”°. Given 
the inherent limitations of transcriptome data, we searched for large- 
fold changes in 23 of the largest gene families in Arabidopsis thaliana” 
that are involved in many important functions (suchas transcriptional 
regulation, enzymatic and signalling function, and transport; Fig. 5 and 
Supplementary Tables 7, 8). Although our RNA-sequencing-based sam- 
pling of expressed genes is incomplete, the median representation of 
universally conserved genes” was 80-90% for taxa across Viridiplantae 
(Extended Data Fig. 3a, b). Furthermore, there was a strong correlation 
(r= 0.95) between gene-family sizes in our transcriptomes (focusing 
on the largest gene families) and those of fully sequenced genomes 
(Extended Data Fig. 3c—f). We identified gene-family expansions and 
contractions, including some that have been described previously *. 
Specifically, the AP2, bHLH, bZip and WRKY transcription factor families 
were inferred to be present in the last common ancestor of Viridiplan- 
tae, whereas the origin of GRAS and NAC genes occurred in early strep- 
tophytes after divergence from the chlorophyte algal lineage (Fig. 5). 
The highest concentration of expansion events was inferred along the 
‘spine’ of the phylogeny between the origins of Viridiplantae and vascular 
plants (Fig. 5b and Supplementary Table 7). Expansions of some focal 
gene families also continued after the origin of embryophytes; however, 
no expansions occurred in association with the origin and radiation of 
angiosperms (Fig. 5). Gene-family expansions and functional diversi- 
fication may have contributed to the adaptations required for life in 
terrestrial habitats, but the sizes of these focal gene families apparently 
stabilized in the face of continued gene duplication and loss throughout 
the evolution of vascular plants. 


Primary acquisition of the plastid 

The primary acquisition of the plastid in an ancestor of extant Archae- 
plastida was a pivotal event in the history of life. All possible relation- 
ships among Viridiplantae, Glaucophyta and Rhodophyta have been 
hypothesized, with alternative implications for the gain and loss of 
characters* in the early history of the three lineages. Strong support for 
the sister relationship of Viridiplantae and Glaucophyta® (Figs. 2, 3a) 
found here indicates that ancestral red algae lost flagella and pepti- 
doglycan biosynthesis, perhaps associated with a reduction in genome 
size**, Peptidoglycan biosynthesis was independently lost early in the 
evolution of Chlorophyta” and within angiosperms*.. 


The history of Viridiplantae 

The origin of Viridiplantae is marked by the loss of light-harvesting 
phycobilisomes composed of phycobiliproteins, the evolution of the 
accessory photosynthetic pigment chlorophyll b, which has a distinct 
light-absorption spectrum relative to chlorophyll a, and intraplastidial 
starch synthesis and deposition. Viridiplantae are consistently recovered 
as monophyletic, with early diverging Chlorophyta and Streptophyta 
lineages*” *. However, the placement of the picoplanktonic algal lineage 
Prasinococcales was unstable in our analyses (Fig. 3e). 


Diversification within Chlorophyta 

All nuclear-gene analyses resolved a grade of largely marine unicellular 
lineages subtending the core clade consisting of Trebouxiophyceae, Ulvo- 
phyceae and Chlorophyceae” (Fig. la-c and Supplementary Figs. 1-3). 
The nuclear supermatrix and ASTRAL trees placed Trebouxiophyceae 
as sister to a clade containing Chlorophyceae and Ulvophyceae*?*’. 
However, whereas the supermatrix trees supported Ulvophyceae as 
monophyletic, the ASTRAL tree resolved Ulvophyceae as a grade and 
Bryopsidales is poorly supported as sister to Chlorophyceae (Fig. 3h). All 
tree estimates suggest that there were multiple origins of multicellularity 
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Fig. 2| Phylogenetic inferences of major clades. Phylogenetic inferences were 
based onASTRAL analysis of 410 single-copy nuclear gene families extracted 
from genome and transcriptome data from1,153 species, including 1,090 green 
plant (Viridiplantae) species (Supplementary Table 1). a, Phylogram showing 
internal branch lengths proportional to coalescent units (2N, generations) 
between branching events, as estimated by ASTRAL-II* v.5.0.3. b, Relationships 


within Ulvophyceae. Only 12 out of 119 sampled chlorophyte species 
exhibited evidence of a WGD in their ancestry, and most of these puta- 
tive WGDs were restricted to single clades. 


Streptophyta 

The evolution of streptophytes was associated with several adapta- 
tions to terrestrial habitats****. All analyses recovered Mesostigma, 
Chlorokybus and Spirotaenia minuta in a clade that is sister to the 
remainder of Streptophyta® with successive divergence of Klebsor- 
midiales, Charophyceae (Fig. 1d), Coleochaetophyceae and Zygnema- 
tophyceae (Fig. le) relative to Embryophyta. However, with greatly 
increased taxon sampling relative to our previous work”, internal 
branch lengths are diminished, and we could not reject the possibility 
of atrue radiation giving rise to Coleochaetales, Zygnematophyceae 
and embryophyte lineages (land plants; Figs. 1f-p, 3g(II)). Although 
quartet support for aclade of Coleochaetales and Zygnematophyceae 
as sister to embryophytes was similar to support for Zygnematophy- 
ceae as sister to embryophytes, a clade consisting of Coleochaetales 
and land plants was not supported. 


Embryophyta 

Land plants include many of the most familiar green plants (for example, 
bryophytes (Fig. 1f-h), lycophytes (Fig. 1i), ferns (Fig. 1j, k) and seed 
plants (Fig. 1I-p)). They exhibit key innovations, including protected 
reproductive organs (archegonia and antheridia) and the development 
of the zygote within an archegonium into an embryo that receives mater- 
nal nutrition. Resolving relationships among bryophytes (mosses, liv- 
erworts and hornworts) and their relationships to the remaining land 
plants has long been problematic, but is critical for understanding the 
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among major clades with red box outlining flowering plant clade. Species 
numbers are shown for each lineage. Most inferred relationships were robust 
across data types and analyses (Supplementary Figs. 1-3) with some exceptions 
(Supplementary Fig. 6). Data and analysis scripts are available at https://doi. 
org/10.5281/zenodo.3255100. 


evolution of fundamental innovations within land plants, including 
the tolerance to desiccation, shifts in the dominance of multicellular 
haploid and diploid generations, and parental retention of a multicel- 
lular embryo. 

Bryophytes have sometimes been resolved as a grade*”“®, with liver- 
worts, mosses and hornworts as successive sister groups to Tracheo- 
phyta (vascular plants; Fig. 1i-p). We recovered extant bryophytes as 
monophyletic in the ASTRAL analysis of nuclear gene trees (Fig. 3b) 
and plastome analyses, with hornworts sister to a moss and liverwort 
clade. All analyses rejected the hypothesis that liverworts are sister to 
all other extant land plant lineages’. 

The largest number of gene-family expansions in our analyses was 
associated with the origin of land plants and the evolution of bryophytes 
(transition between streptophyte algae and bryophytes in Fig. 5b). By 
contrast, we found no evidence of WGD on the stem branch for land 
plants (Supplementary Tables 5, 6). 


Vascular plants 

Within the vascular plants, lycophytes are supported as the sister group 
of Euphyllophyta (ferns and seed plants). We found no evidence of 
pan-vascular-plant or ancestral euphyllophyte WGDs, but some 
gene-family expansions were associated with the origin of vascular 
plants (Fig. 5b). 

Within ferns (Polypodiopsida), plastid data weakly support Equise- 
tales as sister to Psilotales and Ophioglossales (Supplementary Fig. 3), 
whereas nuclear gene analyses robustly place Equisetales sister to the 
remaining ferns*°. The supermatrix and plastome-based trees placed 
Marattiales sister to the leptosporangiate ferns*° (Polypodiidae), but 
ASTRAL recovers nearly equal quartet support for this hypothesis or 
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Fig. 3 | Alternative branching orders for contentious relationships. Local 
posterior probabilities (shown only when below 1.0) and gene-tree quartet 
frequencies (bar graphs) for alternative branching orders for contentious 
relationships in the plant phylogeny (see text). a, Early Archaeplastida 
diversification. b, Earlyembryophyte diversification. c, Gymnosperms. d, Early 
angiosperm diversification. e, Early Viridiplantae diversification. f, Early fern 
diversification. g, The sister lineage to land plants. h, Trebouxiophyceae, 
Ulvophyceae and Chlorophyceae. i, Eudicot diversification. Red bars represent 


for Marattiales as sister to Psilotales and Ophioglossales (Fig. 3f). Lepto- 
sporangiate ferns (Fig. 1k) experienced more WGD events than any 
other lineage of Viridiplantae outside the angiosperms, with an average 
of 3.79 inferred WGDs in the history of each sampled species (Fig. 4). 
WGD was inferred in an ancestor of all extant ferns and an additional 
19 putative WGDs were implicated in the ancestry of fern subclades 
(Ophioglossaceae and Polypodiaceae; Fig. 4, Supplementary Fig. 8 
and Supplementary Tables 2, 5, 6). Considering the high chromosome 
numbers of some ferns, our discovery that they exhibit one of the high- 
est frequencies of palaeopolyploidization among green plants is not 
unexpected”. 

Whereas none of our focal gene families exhibited significant expan- 
sionin ferns, significantly more MIKC-type MADS-box genes—involved 
in specification of ovule and flower development in seed plants*—were 
observed in leptosporangiate ferns relative to all other green plant line- 
ages, other than seed plants (Extended Data Fig. 1). The ancestral number 
of MIKC-type MADS-box genes for ferns and seed plants was 4 or 5, and 
gene numbers increased independently within leptosporangiate ferns 
and seed plants (Extended Data Figs. 1, 2). 


Seed plants 


A WGD in the ancestry of all extant seed plants has been inferred pre- 
viously'*? but remains contested™. Gene-tree’® analyses revealed 
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the ASTRAL topology; blue and yellowtrees and bars represent the frequencies 
of alternative branching orders in ASTRAL. The topologies recovered inthe 
concatenated supermatrix analysis and plastid gene analyses are also indicated. 
Dashed horizontal lines mark expectation for a hard polytomy (purple). 

Ing-i, panels include more than 4 tips, so nodes are delineated with Roman 
numerals and bar graphs are shown for each node and asterisks above branches 
indicate failure to reject the hypothesis that the nodeisa polytomy. Data and 
analysis scripts are available at https://doi.org/10.5281/zenodo.3255100. 


significantly more gene duplications on the branch leading to extant 
seed plants than expected from background gene birth and death rates 
(analyses D1 (P< 2.0 x 10%) and D2 (P< 8.9 x 10°) in Supplementary 
Table 5). Numerous gene-family expansions were also associated with 
the origin of seed plants, and only one contraction was detected among 
the gene families analysed (Fig. 5b). Type II MIKC-type MADS-box genes 
exhibited a nearly twofold expansion independent of their expansion 
in ferns (Extended Data Figs. 1, 2). 

Extant gymnosperms (approximately 1,000 species) are sister to 
flowering plants, and all of our analyses recovered Cycadales and Ginkgo 
(Fig. 11) as a sister clade to the remaining gymnosperms (Fig. 3c). The 
placement of Gnetales conflicts strongly among the ASTRAL, superma- 
trix and plastome-based trees. Plastid data strongly support the ‘Gnecup’ 
hypothesis, with Gnetales as sister to a clade comprising Araucariales 
and Cupressales”, whereas the supermatrix analysis of nuclear genes 
supports a ‘Gnepine’ hypothesis with Gnetales as sister to Pinales>”°. 
ASTRAL analyses strongly support the ‘Gnetifer’ hypothesis, with coni- 
fers (Araucariales, Cupressales and Pinales) sister to Gnetales®. The 
short internal branches in the ASTRAL tree suggest rapid diversification 
(Fig. 2). However, the uneven frequencies of gene-tree quartets—which 
support the alternative Gnecup and Gnepine hypotheses—suggest 
that gene-tree estimation biases* associated with increased substitu- 
tion rates in Gnetales*’ or gene flow are possible sources of gene-tree 
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Fig. 4| The distribution of inferred ancient WGDs across lineages of green 
plants. a, The locations of estimated WGDs are labelled red in the phylogeny of all 
1000 Plants (IKP) samples. b, The number of inferred ancient polyploidization 
events within each lineage is shown in the violin plots. The white dot indicates the 
median, the thick black bars represent the interquartile range, the thin black lines 


discordance®. Previously inferred WGDs in ancestors of Welwitschia, 
Pinaceae and Cupressales'’ are supported, as is anew inference of WGD in 
the ancestry of Podocarpaceae (Fig. 4 and Supplementary Tables 2, 5, 6). 
Angiosperms are by far the largest clade of green plants (more than 
370,000 species’) and are marked by multiple key innovations, including 
the carpel, double fertilization, endosperm, and for most angiosperms, 
vessel elements. Both nuclear and plastid phylogenomic analyses agree 
with previous studies” in providing strong support for angiosperm 
monophyly and in placements of Amborellales, Nymphaeales and Aus- 
trobaileyales as successive sisters to all other angiosperms (Figs. 2, 3). 
Chloranthales and magnoliids comprise a clade inthe ASTRAL and super- 
matrix analyses, but were resolved with poor support as successive sister 
lineages to all other Mesangiospermae (monocots, Ceratophyllum and 
eudicots) in the plastome-based tree. Whereas Ceratophyllum is sister 
to eudicots inthe ASTRAL and plastome trees, it is poorly supported as 
sister to monocots in the supermatrix tree (Supplementary Figs. 1-3). All 
analyses suggest short time intervals between branching of the mono- 
cots, Magnoliidae, Chloranthales, Ceratophyllales and eudicot lineages 
in early mesangiosperm history (Fig. 2 and Supplementary Figs. 1-3). 
Pentapetalae (70% of all angiosperms) are marked by the evolution 
of the pentamerous flower. Substantial gene-tree discordance was 
observed for relationships among core rosids, Saxifragales, Vitales, 
Dillenia, Santalales, Berberidopsidales, Caryophyllales, asterids and 
Gunnerales (the sister group of Pentapetalae; Fig. 3i). Short internal 
branches and poor support in the ASTRAL tree at the base of the core 
eudicots (Figs. 2, 3i) indicate rapid diversification following two rounds 
of WGD that resulted in palaeohexaploidy preceding the origin of the 
clade®' (Supplementary Fig. 8). The supermatrix and plastid trees 
conflict with the poorly supported ASTRAL branching order (Fig. 3i). 
With the exception of the Berberidopsidales and core asterid clade, 
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define the 95% confidence interval and the grey shading represents the density of 
data points. The sample sizes for each lineage are shown within parentheses along 
with taxon names on the phylogeny. The phylogenetic placement of inferred 
WGDsis illustrated in Supplementary Fig. 8 and data supporting each WGD 
inference are provided in Supplementary Table 2. 


we were not able to reject the possibility of polytomies at the evaluated 
nodes in ASTRAL analyses (Fig. 3i). 

Genomic and phylogenomic analyses have identified numerous WGDs 
throughout angiosperm history”. We found evidence that extant 
flowering plants descend froma polyploid common ancestor’. Gene- 
tree analyses detected a significantly larger-than-background number 
of gene duplications on the branch leading to the last common ancestor 
of extant angiosperms after divergence from the extant gymnosperm 
clade (analyses E1 (P< 1.8 x 10) and E2 (1.4 x 10*) in Supplementary 
Table 5). Furthermore, the numbers of inferred duplications on the 
stem branch of angiosperms were consistent with expectations for 
WGD (analyses El and E2 in Supplementary Table 6). We inferred over 
180 WGDs within flowering plants, including 132 in eudicots and 35 in 
monocots (Supplementary Table 2). 

The origin of the angiosperms was preceded by three focal gene-family 
contractions and no expansions (Fig. 5b), consistent with the hypothesis 
that the innovations in angiosperms may have involved the functional 
co-option of genes that were duplicated earlier inthe evolution of seed 
plants”. We find that orthologues of some floral homeotic MADS-box 
genes originated inthe stem group of extant seed plants approximately 
300 million years ago (Extended Data Fig. 2), supporting the hypothesis 
that the origin of the angiosperm flower involved recruitment of devel- 
opmental regulators that already existed in their seed plant ancestors’. 


Synthesis 

These analyses establish a foundation for advancing our understanding 
of the overall phylogenetic framework of green plants and the genetic 
changes that were responsible for the characteristic traits associated 
with major evolutionary transitions in Viridiplantae. Portions of the 


Nature | Vol574 | 31October 2019 | 683 


Article 


a 
2507 
200 4 
gs 
i 
E 
Ss 1504 
wo @ 
2 5 
Lo>) 
= 
32 1004 
25 
504 
oJ 
aoa =F g ee a CS © ¢ a rey oS cS cS 
NX pos Q Soi LO in i fey) Ni: oO . nN — 
ssk a @ La0 oO 2 6 @ re x Oo x 3 
@ 2 Ss Got yg € €& 6 S ae} 7) ® a o 
ef&ao £ 2> 3938 o 5 & 9 o z sf Ke} 3 
E aio rom eo S68 ir 8 ig fo) a: 5 = 8 < 
FSS 8&8 2 535 gE 5 a z 2 é 5 
5 x 2 Rint 2 og $ + gz [om oO 
Fe & |e > E E 3 
e 2 > a a 
a o oe) o) 
b : L 
i | ABC 
| | | ABI3VP1 
; == AP2/EREBP 
Bsirth | | 1 “| bHLH 
2 | | bZIP 
Ss | | |C2H2 
a Bs a C3H 
g | a 
ee OLY —_ Gros 
= 8 Be = GRAS 
we 4 a a a NS! (<1 
@8 2 | | HB 
55 = MADS box 
S5 | |MYB 
cS i NAG 
3 0.500 a a NBS LRR 
ir 0.250 } I P450s 
ffo.125 {oy PHD 
| | PP2C 
| | PPR P class 
| | PPR PLS class 
| | _|Receptor-like kinases 
| } 


Fig. 5 | Assessment of significant expansions and contractions of largest 
plant gene families. a, Weighted average gene-family size for species groups 
(normalized to account for differences in gene-family sizes, weight =1/ 
(maximum observed gene-family size)). The ANA grade comprises Amborellales, 
Nymphaeales and Austrobaileyales, successive sister lineages to a clade with the 
remaining extant angiosperms; the ‘CRPT+B’ grade includes Ceratophyllales, 
Ranunculales, Proteales lineages and a Trochodendrales + Buxales clade in the 
ASTRAL tree (Fig. 2). Sample sizes are proportional to bar widths (from left to 
right, n=23 (Chromista), 18 (Rhodophyta), 2 (Glaucophyta), 94 (Chlorophyta), 


species tree reported here remain unresolved. Phylogenetic analyses 
of genes extracted from a broad sampling of whole-genome sequences 
may improve gene family circumscriptions and resolve the species tree 
further. Expanded genome sequencing may also help to accurately 
account for interspecific gene flow, and orthology in the face of gene 
duplications and losses. However, for some nodes in the species tree, 
extensive discordance among inferred gene histories suggests that 
rapid diversification may not always conform to strict bifurcation of 
ancestral species into two descendent species. 

Gene and genome duplications have long been considered a source 
of evolutionary novelty”, producing an expanded molecular reper- 
toire for adaptive evolution of key pathways and shifts in plant devel- 
opment and ecology. However, the direct connections between key 
innovations and specific gene duplications are rarely known, due 
in part to lag times between duplications and such inovations™ ””. 
Phylogenetically informed experimental investigations of changes 
in gene content and function will improve our understanding of the 
roles of gene and genome duplications in the evolution of key innova- 
tions. Such efforts are underway, drawing on an expanding number 
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42 (streptophyte algae), 7 (hornworts), 18 (liverworts), 38 (mosses), 

16 (lycophytes), 59 (ferns; monilophytes), 76(gymnosperms), 6 (ANA grade), 

96 (monocots), 1 (*representing Chloranthales), 22 (magnoliids), 29 (CRPT+B grade), 
205 (asterids), 48 (caryophyllids), 176 (rosids), 23 (Saxifragales) and 6 (Santalales). 

b, Gene families exhibiting significant copy number changes (two-sided 
Kolmogorov-Smirnov test; P<1 10°; gene-family expansions represent a gain of 
more than 50% and contractions represent a loss of more than 33%) with colour codes 
showing the magnitude of the observed fold changes. Data and analysis scripts are 
available at https://github.com/GrosseLab/OneKP-gene-family-evo. 


of experimental model species distributed across the green plant 
tree of life®. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized, although simulations included in 
the genome duplication analyses did include drawing from random 
distributions. The investigators were not blinded to allocation during 
experiments and outcome assessment. 


Transcriptome sequencing 

RNA was isolated from young vegetative tissue from all of the species 
that were included in our phylogenomic analyses as described else- 
where®”*”*8, Reproductive tissues were also included for some species 
(Supplementary Table 1). Transcript assembly, contaminant identifica- 
tion and gene-family circumscription were also performed as described 
previously” and are described in more detail in the Supplementary 
Methods. 


Phylogeny reconstruction 

Analyses were performed on single-copy gene trees using ASTRAL to 
account for variation among gene trees owing to incomplete lineage 
sorting’. ASTRAL analyses were performed on gene trees estimated 
from unbinned amino acid alignments, first and second codons, statisti- 
cally binned supergenes with unweighted bins”’” and filtered taxon sets 
(excluding ‘rogue’ taxa as described below), with filtering of gene-tree 
bootstrap support thresholds of up to 33% to see whether the effects of 
gene-tree estimation error could be reduced (Supplementary Fig. 6). Bin- 
ning left the majority of genes in singleton bins and had minimal effects 
onthe overall species tree. Unless otherwise specified, we use ‘ASTRAL 
topology’ to refer to the tree inferred from 410 unbinned amino acid 
alignments in which branches with 33% or less support are contracted. 
In addition, supermatrix analyses were performed on concatenated 
nuclear gene alignments and concatenated plastid gene alignments 
compiled using previously described methods”. All scripts used to 
perform analyses on the nuclear gene data are available at https://doi. 
org/10.5281/zenodo.3255100. 


Multiple sequence alignment and data filtering. We built a multiple 
sequence alignment based on predicted amino acid sequences of each 
gene and forced DNA sequences to conform to the amino acid alignment. 
We first divided sequences in each gene into two subsets, full-length 
and abnormal sequences, and then used PASTA” with default settings 
toalign full-length sequences and UPP” to add abnormal sequences to 
the full-length alignment. We designated as abnormal any sequence that 
was 66% shorter or 66% longer than the median length of the full-length 
gene sequences. Once UPP alignments were obtained, we removed from 
them all unaligned (that is, insertion) sites. DNA alignments were then 
derived from amino acid sequence alignments (FAA2FNA) and third 
codon positions were removed owing to extreme among-species vari- 
ation in GC content (Supplementary Fig. 7). To reduce running time, we 
then masked all sites from the alignment that contained more than 90% 
gaps. Finally, because the inclusion of fragmentary data in gene-tree 
estimation can be problematic”, we removed any sequence that had 
agap for at least 67% of the sites in the site-filtered alignment (the 67% 
threshold was chosen based on simulation results”). Gene sequence 
occupancy for 410 single-copy genes in the 1,178 accessions used in our 
analyses is displayed as a frequency histogram (Supplementary Fig. 4) 
and a heat map (Supplementary Fig. 5). 

In addition to filtering gappy sites and fragmentary sequences, we 
identified and removed sequences that were placed on extremely long 
branches on their respective gene trees. To identify these, we used the 
initial alignments to build gene trees (see below). We then rooted each 
gene tree by finding the bipartition that separated the largest exclusive 
group of outgroup or red algae taxa. If red algae were entirely missing 
for the gene, we used Glaucophyta, Prasinococcales, prasinophytes, 


Volvox carteri, Chlamydomonas reinhardtii or Klebsormidium nitens. 
We then removed any sequences that had a root-to-tip distance that was 
four standard deviations longer than the median root-to-tip distance in 
each gene tree. Once these sequences on long branches were removed, 
alignments were re-estimated using the same approach described above, 
and new gene trees were estimated. 


Gene-tree estimation. To estimate gene trees, we used RAXML v.8.1.17”, 
with one starting tree for building initial trees (used for long-branch 
filtering) and 10 different starting trees for final gene trees. Support 
was assessed with 100 replicates of bootstrapping. For DNA analyses, 
the GTR substitution model and the GAMMA-distributed site rates were 
used. For amino acid sequences, we used a Perl script adapted fromthe 
RAXML website to search among 16 different substitution models ona 
fixed starting tree per gene and chose the model with the highest likeli- 
hood (JTT, JTTF or JTTDCMUT were selected for 349 out of 410 genes). 
For amino acid trees, we also used the GAMMA- distributed site rates. 


Species tree estimation. We used ASTRAL-II* v.5.0.3 to estimate the 
species tree on the basis of all 410 genes; using 384 genes that each in- 
cluded at least half of the species changed only 3 low-support branches. 
We used multi-locus bootstrapping’”” and the built-in local posterior 
probabilities of ASTRAL to estimate branch support” and to test for 
polytomies”, drawn on species trees estimated based on the maximum- 
likelihood gene trees. We also used the built-in functionality of ASTRAL 
(version 4.11.2) to compute the percentage of gene trees that agreed 
with each branch in the species tree, by finding the average number 
of gene-tree quartets defined around the branch (choosing one taxon 
from each side) that were congruent with the species tree and used 
DiscoVista® to visualize them (Fig. 4). Median representation of each 
species across the 410 single-copy gene trees was 82.4% with 88.2% and 
67.1% of species having assemblies for at least 50% or 75% of the 410 
single-copy genes, respectively. A large body of work on phylogenetic 
methodologies has established that gene and species tree estimation can 
be robust to missing data, particularly with dense taxon sampling”. 
Recent papers have even established statistical consistency under miss- 
ing data®*. Similar evidence of robustness also exists in the context of 
concatenated analyses** ®°, 

All supermatrix analyses are based on the filtered amino acid and 
first and second codon position alignments that included at least half 
of the species for 384 genes. The (1) unfiltered supermatrices used the 
gene alignments as is; the (2) eudicot supermatrices retained only eud- 
icot species in the supermatrix; and the (3) supermatrices with eight 
‘rogue’ taxaremoved (Dillenia indica, Tetrastigma obtectum, Tetrastigma 
voinierianum, Vitis vinifera, Cissus quadrangularis, ‘Spirotaenia’ sp., 
Ceratophyllum demersum and Prasinococcus capsulatus) that varied 
in placement among our full ASTRAL, supermatrix and plastid genome 
analyses. Well-supported branching orders were stable among analyses 
(Supplementary Fig. 6). 

Maximum-likelihood supermatrix analyses were performed using 
ExaML v.3.0.14°*”. Similar to the gene-tree analyses, the GAMMA model 
of rate heterogeneity across sites was used for all maximum-likelihood 
supermatrix analyses. To better handle model heterogeneity across 
genes, we divided the supermatrix into partitions. For the amino acid 
alignments, the protein model selected for each gene family in the gene- 
tree estimation process was used to group genes into partitions, creating 
one partition per substitution model. For the nucleotide alignments, 
we estimated the GTR transition rate parameters and the alpha shape 
parameter for each codon position (first and second positions) of each 
alignment using RAXML v.8.1.21”. We then projected the maximum- 
likelihood parameter values for each gene into a two-dimensional plane 
using principal component analysis®®. We performed k-means cluster- 
ing®’ in R® to group the codon positions into partitions, selecting k=8, 
which accounted for 80% of the variation. Trees derived from nucleotide 
alignments can be found at https://doi.org/10.5281/zenodo.3255100). 


To examine the influence of the starting tree on the likelihood of the 
final tree, we performed preliminary analyses on an earlier version of 
our supermatrices. We generated nine different maximum-parsimony 
starting trees using RAXML v.8.1.21 and one maximum-likelihood start- 
ing tree using FastTree-2 v.2.1.5”. We then ran ExaML on each of the 
starting trees, noting the final maximum-likelihood score. We found that 
in all cases, the ExaML maximum-likelihood tree using the FastTree-2 
maximum-likelihood starting tree had a better maximum-likelihood 
score than any of the ExaML maximum-likelihood trees using maximum- 
parsimony starting trees. Thus, for all of the supermatrix analyses, we 
used FastTree-2 to generate our initial starting tree. Support was inferred 
for the branches of the final tree from 100 bootstrap replicates. 

Outgroup taxa from outside Archaeplastida were used to root all spe- 
cies trees estimated using nuclear genes (all ASTRAL and supermatrix 
analyses). The plastome supermatrix tree for Viridiplantae was rooted 
using Rhodophyta as outgroup. 


Inferring and placing WGDs 

DupPipe analyses of WGDs from transcriptomes of single species. 
For each transcriptome, we used the DupPipe pipeline to construct gene 
families and estimate the age distribution of gene duplications’*”. We 
translated DNA sequences and identified reading frames by compar- 
ing the Genewise” alignment to the best-hit protein froma collection 
of proteins from 25 plant genomes from Phytozome™. For all DupPipe 
runs, we used protein-guided DNA alignments to align our nucleic acid 
sequences while maintaining the reading frame. We estimated synony- 
mous divergence (K,) using PAML with the F3X4 model” for eachnodein 
the gene-family phylogenies. We identified peaks of gene duplication as 
evidence of ancient WGDs in histograms of the age distribution of gene 
duplications (K, plots). We identified species with potential WGDs by 
comparing their paralogue age distribution to a simulated null using 
a Kolmogorov-Smirnov goodness of fit test. We then used mixture 
modelling and manual curation to identify significant peaks consistent 
with a potential WGD and to estimate their median paralogue K, values. 
Significant peaks were identified using a likelihood ratio test in the boot. 
comp function of the package mixtools in R®. 


Estimating orthologous divergence. To place putative WGDs in rela- 
tion to lineage divergence, we estimated the synonymous divergence 
of orthologues among pairs of species that may share a WGD based on 
their phylogenetic position and evidence from the within-species K, 
plots. We used the RBH Orthologue pipeline” to estimate the mean and 
median synonymous divergence of orthologues and compared those 
to the synonymous divergence of inferred paleopolyploid peaks. We 
identified orthologues as reciprocal best blast hits in pairs of transcrip- 
tomes. Using protein-guided DNA alignments, we estimated the pairwise 
synonymous divergence for each pair of orthologues using PAML with 
the F3X4 model”. WGDs were interpreted to have occurred after lineage 
divergence if the median synonymous divergence of WGD paralogues 
was younger than the median synonymous divergence of orthologues. 
Similarly, ifthe synonymous divergence of WGD paralogues was older 
than that orthologue synonymous divergence, then we interpreted 
those WGDs as shared. 


MAPS analyses of WGDs from transcriptomes of multiple species. 
To infer and locate putative WGDs in our datasets, we used a gene-tree 
sorting and counting algorithm, the multi-taxon paleopolyploidy search 
(MAPS) tool’®. For each MAPS analysis, we selected at least two species 
that potentially share a WGD in their ancestry as well as representative 
species from lineages that may phylogenetically bracket the WGD. MAPS 
uses this given species tree to filter collections of nuclear gene trees for 
subtrees consistent with relationships at each node in the species tree. 
Using this filtered set of subtrees, MAPS identifies and records nodes 
with a gene duplication shared by descendant taxa. To infer and locate 
a potential WGD, we compared the number of duplications observed 


at each node to a null simulation of background gene birth and death 
rates”’*’, A Fisher’s exact test, implemented in R®, was used to identify 
locations with significant increases in gene duplication compared with 
anull simulation (Supplementary Table 5). Locations with significantly 
more duplications than expected were then compared to a simulated 
WGDatthis location. Ifthe observed duplications were consistent with 
this simulated WGD using Fisher’s exact test, we identified the location as 
aWGDif it was consistent with inferences from K, plots and orthologue 
divergence data. Insome cases, MAPS inferred significant duplications 
without apparent signatures in K, plots or previously published research. 
In these cases, we recognized the event as a significant burst of gene 
duplication. 

Each MAPS analysis was designed to place focal WGDs near the centre 
of a species tree to minimize errors in WGD inference. Errors in tran- 
scriptome or genome assembly, gene-family clustering and the con- 
struction of gene-family phylogenies can result in topological errorsin 
gene trees”. Previous studies have suggested that errors in gene trees 
can lead to biased placements of duplicates towards the root of the 
tree and losses towards the tips of the tree’. For this reason, we aimed 
to put focal nodes for a particular MAPS analysis test in the middle of 
the phylogeny. To further decrease potential error in our inferences of 
gene duplications, we required at least 45% of the ingroup taxa to be 
present in all subtrees analysed by MAPS”. If this minimum requirement 
of ingroup taxa numbers is not met, the gene subtree will be filtered 
out and excluded from our analysis. Increasing taxon occupancy leads 
toa more accurate inference of duplications and reduces some of the 
biases in mapping duplications onto a species tree’. To maintain suf- 
ficient gene-tree numbers for each MAPS analysis, we used collections 
of gene-family phylogenies for six to eight taxa to infer ancient WGDs. 

For each MAPS analysis, the transcriptomes were translated into 
amino acid sequences using the TransPipe pipeline”. Using these trans- 
lations, we performed reciprocal protein BLAST (BLASTp) searches 
among datasets for the MAPS analysis using a cut-off of F=1* 10°. We 
clustered gene families from these BLAST results using OrthoFinder 
under the default parameters’. Using a custom Perl script (https:// 
bitbucket.org/barkerlab/MAPS), we filtered for gene families that con- 
tained at least one gene copy from each taxon ina given MAPS analysis 
and discarded the remaining OrthoFinder clusters. We used PASTA” for 
automatic alignment and phylogeny reconstruction of gene families. 
For each gene-family phylogeny, we ran PASTA until we reached three 
iterations without an improvement in likelihood score using a centroid 
breaking strategy. Within each iteration of PASTA, we constructed subset 
alignments using MAFFT'™, used Muscle’ for merging these subset 
alignments and RAxML” for tree estimation. The parameters for each 
software package were the default options for PASTA (https://bitbucket. 
org/barkerlab/1kp). We used the best-scoring PASTA tree for each multi- 
species nuclear gene family to collectively estimate the numbers of 
shared gene duplications on each branch of the given species. 

To generate null simulations, we first estimated the mean background 
gene duplication rate (A) and gene loss rate (1) with WGDgc”s (Supple- 
mentary Tables 5, 11).Gene count data were obtained from OrthoFinder'™ 
clusters associated with each species tree (Supplementary Table 5). 
Aand p were estimated using only gene clusters that spanned the root 
of their respective species trees, which has been shown to reduce biases 
inthe maximum-likelihood estimates” ofA and uz. We chosea maximum 
gene-family size of 100 for parameter estimation, which was necessary 
to provide an upper bound for numerical integration of node states”®. 
We provided a prior probability distribution onthe number of genes at 
the root of each species tree, such that ancestral gene-family sizes fol- 
lowed a shifted geometric distribution with mean equal to the average 
number of genes per gene family across species (Supplementary Table 5). 

Gene trees were then simulated within each MAPS species trees using 
the GuestTreeGen program from GenPhyloData’. For each species 
tree, we simulated 3,000 gene trees with at least one tip per species: 
1,000 gene trees at the A and x: maximum-likelihood estimates, 1,000 
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genetrees at halfthe estimated A and 1, and 1,000trees at three timesA 
and u. For all simulations, we applied the same empirical prior used for 
estimation ofA and yw. We then randomly resampled 1,000 trees without 
replacement from the total pool of gene trees 100 times to providea 
measure of uncertainty on the percentage of subtrees at each node. For 
positive simulations of WGDs, we simulated gene trees using the same 
approach used to generate null distributions (Supplementary Table 5) 
butincorporated a WGD at the test branch. Previous empirical estimates 
of paralogues retained following a plant WGD are 10% on average’. To 
be conservative for inferring WGDs in our MAPS analyses, we allowed 
at least 20% of the genes to be retained following the simulated WGD to 
account for biased gene retention and loss. For WGDs that might havea 
lower gene retention rate, we used an additional simulation using 15% 
gene retention (Supplementary Table 6). 


Gene-family evolution 

Transcriptome-based gene-family size estimation. To robustly 
estimate gene-family sizes from transcriptomic data, we needed to 
overcome three major challenges: (1) the fragmentation of transcript 
sequences; (2) the absence of low-abundance transcripts; and (3) the 
over-prediction of gene-family sizes due to assembly duplications and 
biological isoforms. We dealt with these challenges as follows. 


Fragmentation of data. The multiple sequence alignments used to 
construct the domain-specific profile hidden Markov models (HMMs) 
ranged from 23 to 463 amino acids in length; 78% of these alignments were 
shorter than 120 amino acids, and 84.6% of the assembled and translated 
transcripts were longer than 120 amino acids. By mainly characterizing 
gene families using single domains (Supplementary Table 9), we limited 
the effect of the fragmentation of transcripts from the assembly of short 
read data. HMMs used for gene-family classification and decision rules 
obtained from either published work” or gene-family experts are given 
in Supplementary Table 9; 12 out of 23 gene families were classified by a 
single ‘should’ rule, 2 out of 23 were defined by a XOR ‘should’ rule, which 
also classifies a sequence by the presence of asingle domain, 8 out of 23 
gene families were classified by amore complex rule set including ‘should 
not’ rules. The only gene family for which multiple domains needed to 
be present was the PLS subfamily of the PPR gene family. 


Loss of low abundance transcripts. To account for possible bias in 
the sampling of the gene space, all species that showed low levels of 
transcriptome completeness were removed. The lowest value of tran- 
scriptome completeness obtained from 30 annotated plant genomes 
was used as the lower exclusion limit. We removed all samples in which 
more than 42.5% of BUSCO* sequences were missing using default 
settings and the eukaryotic dataset as the query database. 


Gene-family over-prediction. We clustered assembled protein 
sequences by sequence similarity and merged sequences that showed at 
least 99% identity. To check for the possibility of merging sequences that 
should be counted separately, different identity cut-offs were compared 
between the 1KP datasets and 32 annotated plant genomes. 

Extended Data Figure 3c, d shows the average gene-family sizes for 23 
gene families and 13 clades obtained from 1KP samples and 32 annotated 
plant genomes. These gene-family sizes show a high Pearson correlation 
(r=0.95) between 1KP samples and plant genomes, and therefore a linear 
relationship between the two approaches is indicated. The results from 
the 1KP dataset are on average smaller by a factor of 2.3. Although this 
is aclear underestimate, the scale factor by which the estimate is too 
smallis relatively consistent, especially as the gene-family sizes increase. 


Sequence clustering. We used cdhit v.4.5.7°°"’ to reduce the number 
of protein sequence duplications in the dataset. We assessed 100%, 
99.5%, 99%, 95% and 90% sequence identity thresholds. The percentage 
of remaining sequences for the 1IKP samples and 32 reference genomes 


is displayed in Extended Data Fig. 3f. We chose 99% sequence identity 
as the value to use for this study. 


Estimation of gene-family size. Gene-family experts provided the 
knowledge to classify protein sequences as members of gene families 
with profile HMMs. In total, 46 HMMs representing 23 large gene fami- 
lies*° were used to estimate gene-family sizes in the analysed species. 
Classification rules and HMMs for 14 gene families that have been pub- 
lished previously’” were converted to HMMER3 format and used in this 
study. Gene-family classification rules and HMMs for the remaining nine 
families can be found in Supplementary Table 8. HMMs were taken from 
the Pfam database (accessed 12 May 2016) or were provided by gene- 
family experts (Supplementary Table 8). HMMER™ (v.3.1b2) was used 
to scan for matches in the filtered IKP dataset. Where available, gather- 
ing thresholds were used; otherwise an F-value cut-off was applied to 
indicate domain presence. If the F value is not noted in Supplementary 
Table 9, the default F value of 10 was applied. The results on the species 
level are listed in Supplementary Table 10s. 


Statistical test for expansions and contractions. To assess whether a 
gene family expanded or contracted in a lineage, we compared a weight- 
ed average of gene numbers in adjacent clades and grades (Fig. 4). We 
also checked for expansions and contractions within clades but did 
not find any statistically significant shifts. The counts of gene-family 
members from two clades or grades were compared with a Kolmogo- 
rov-Smirnov test with a P-value threshold of 1 x 10° in R®°. The tests 
conducted in this study are listed in Supplementary Table 7. Fold changes 
were computed using the trimmed arithmetic mean in which the top and 
bottom 5% of the data were discarded. Only expansions larger than 1.5 
fold (or contractions smaller than 2/3) are reported. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All raw sequence reads have been posted in the NCBI SRA database under 
BioProject accession PRJEB4922. SRA entries for each assembly are 
listed in Supplementary Table 1. All sequence, gene tree and species 
tree data can be accessed through CyVerse Data Commons at https:// 
doi.org/10.25739/8m7t-4e85. In addition, gene-family nucleotide and 
amino acid FASTA files can also be found at http://jlmwiki.plantbio.uga. 
edu/onekp/v2/; multiple sequence alignments, gene trees and species 
trees for single-copy nuclear genes included in phylogenomic analyses 
are also at https://doi.org/10.5281/zenodo.3255100; K, plots, alignments 
and trees used for WGD analyses can be found at https://bitbucket.org/ 
barkerlab/1kp; and data used for gene-family expansion analyses can 
be found at https://github.com/GrosseLab/OneKP-gene-family-evo. 


Code availability 


Scripts used for phylogenomic species tree analyses are available at 
https://doi.org/10.5281/zenodo.3255100. Scripts used for MAPS anal- 
yses of WGDs are available at https://bitbucket.org/barkerlab/maps 
and scripts used for gene-family expansion analyses are available at 
https://github.com/GrosseLab/OneKP-gene-family-evo. All script files 
are also accessible through CyVerse Data Commons at https://doi. 
org/10.25739/8m7t-4e85. 
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Extended Data Fig. 1| Mean number of MADS-box genes inthe individual transcriptomes. Sample sizes (n) are as follows: liverworts, n= 26; 
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Extended Data Fig. 2| RAXML phylogeny of classic type II MIKC‘ MADS-box 
genes of liverworts, mosses, lycophytes, monilophytes (ferns) and 
spermatophytes (seed plants). CgMADS1 from Chara globularis was used as a 
representative of the outgroup. Branches leading to genes from the different 
phyla are coloured according to the simplified phylogeny of land plants thatis 
shown inthe top left corner. The phylogenetic position of some known type ll 
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MIKC‘ MADS-box genes" representative of previously described clades of 
MADS-box genes are indicated on the right together with the species and 
phylum in which these genes have been identified. The four clades of MIKC* 
MADS-box genes that trace back to the most recent common ancestor of 
Euphyllophytes are shaded in grey. 
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Extended Data Fig. 


Extended Data Fig. 3 | Assessments of transcriptome assembly gene-family 
representation relative to gene-family members identified in annotated 
genomes. a, BUSCO versus CEGMA (CEG) gene occupancy for eachsample. 
BUSCO transcriptome completeness is given as ‘complete plus fragmented’ 
BUSCO percentage using the eukaryota_odb9 database. CEGMA transcriptome 
completeness is given as conditional reciprocal best BLAST hits 

(see Supplementary Methods). Dotted line represents 57.5% (BUSCO) and 70% 
(CEGMA) gene occupancy threshold. Black dots represent IKP samples 
(n=1,020) and blue dots annotated plant genomes (n= 30). b, BUSCO gene 
occupancy for each major clade. Boxes represent lower and upper quartiles; the 
black bold line represents the median and whiskers extend to the most-extreme 
data points. Sample sizes: Chromista, n= 23; Rhodophyta, n=18; Glaucophyta, 
n=2; Chlorophyta, n= 94; streptophyte algae, n= 42; hornworts, n=7; 
liverworts, n=18; mosses, n=38; lycophytes, n=16; monilophytes, n=59; 
gymnosperms, n=76; ANA grade, n= 6; monocots, n= 96; Chloranthales, n=1; 


magnoliids,n=22; CRPT grade, n=29; asterids, n=205; Caryophyllales,n=48; 
rosids, n=176; Saxifragales, n= 23; Santalales, n= 6. Dotted line represents 57.5% 
(BUSCO) gene occupancy threshold. c, Scatterplot of gene-family sizes in 
transcriptomes versus genomes ona logarithmic scale. The grey line indicates 
x=y, the black line indicates a linear regression fitted to the data (n=299; 23 
gene families in 13 species groups). Pearson and Spearman correlation 
coefficients (n=299) are indicated. d, Box plot of transcriptome:genome ratios 
of gene-family sizes for each species group. Boxes indicate upper and lower 
quartiles with median; whiskers extend to data points no more than 1.5x the 
interquartile range (n = 23) with outliers plotted as individual data points.e, f, 
Number of remaining sequences after filtering with cd-hit and a threshold of 
100%, 99.9%, 99%, 95% or 90% in transcriptome sequences and reference 
genomes (Supplementary Table 8). Boxes indicate upper and lower quartiles 
with median; whiskers extend to data points no more than 1.5x the interquartile 
range (e, n=1,451;f, n=32) with outliers plotted as individual data points. 
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Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


[| The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


| The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
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variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


| For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
1 State explicitly what error bars represent (e.g. SD, SE, Cl) 
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Policy information about availability of computer code 


Data collection Transcripts were assembled using SOAPdenovo-Trans assembler (version of 2012-04-05); NCBI BLAST, TransRate, CEGMA6 and 
BUSCO were used to assess assembly quality, translations were performed using TransPipe and Genewise 2.2.2, Gene and species tree 
estimates RAxML v. 8.1.17, FastTree-2 v. 2.1.5, and ExaML v. 3.0.14,ASTRAL-II v. 5.0.3 was used to estimate species trees; scripts for post- 

Data analysis processing, DiscoVista, of trees - https://github.com/smirarab/1kp ; genome duplications were investigated using the DupPipe, PAML, 
and the MAPS pipelines including the GuestTreeGen program withinGenPhyloData - https://bitbucket.org/barkerlab/maps ; analysis 
of gene family expansions included HMMER Vv3.1b2 and scrips available at https://github.com/GrosseLab/OneKP-gene-family-evo 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


5 
jad) 
F 
S 
= 
o 
= 
O 
Za) 
© 
jad) 
= 
a 
=r 
= 
io 

iS 
e) 
a 
5 

a 
wa) 
eS 
S 
= 
fev) 
5 

< 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All data are public: Raw reads in NCBI SRA database - http://www.onekp.com/public_read_data.html ; Assembled transcripts and transcript translations - http:// 
www.onekp.com/public_data.html ; Gene family nucleotide and amino acid fasta files - http://jlmwiki.plantbio.uga.edu/onekp/v2/ ; Multiple sequence 
alignments, gene trees and species trees for single copy nuclear genes - https://github.com/smirarab/1kp 
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Study description Gene and species phylogenies were estimated in order to infer: relationships across the green tree of life (Viridiplantae), the timing 


of genome-scale duplication events, and the timing of gene family expansions. 


RNA was isolated from young vegetative tissue from 1342 samples representing 1147 species across all major subclades of 
Viridiplantae, glaucophytes (Glaucophyta) and red algae (Rhodophyta) and used to generate RNA seq reads and assemblies. 


Research sample 


Samples were collected as available in living collections. Species were chosen for RNA seq with a priority to maximize taxonomic 
diversity across Viridiplantae and outgroups 


Sampling strategy 


Data collection RNA samples were derived from vouchered material in living collections as described in Table 1. 


Timing and spatial scale Samples were collected as available. No attempt was made to control for environmental variation 


Data exclusions RNA samples exhibiting evidence of contamination were excluded from phylogenetic analyses. Contamination was diagnosed 


through BLAST comparisons to ribosomal RNA and plastid gene databases. 


Reproducibility 


Randomization 


Bootstrap analyses and Bayesian posterior probabilities were estimated for all nodes in gene trees and species trees. 


Bootstrap support for nodes gene trees and species trees were estimated in a standard fashion through random resampling of 
columns in sequence alignments. 


Blinding No blinding was done for any of our analyses. 
Did the study involve field work? Yes No 


Reporting for specific materials, systems and methods 
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Dietary habits and vascular risk factors promote both Alzheimer’s disease and 


cognitive impairment caused by vascular factors’ °. Furthermore, accumulation of 
hyperphosphorylated tau, a microtubule-associated protein and a hallmark of 
Alzheimer’s pathology’, is also linked to vascular cognitive impairment>*. In mice, a 
salt-rich diet leads to cognitive dysfunction associated with a nitric oxide deficit in 
cerebral endothelial cells and cerebral hypoperfusion’. Here we report that dietary salt 
induces hyperphosphorylation of tau followed by cognitive dysfunction in mice, and 
that these effects are prevented by restoring endothelial nitric oxide production. The 
nitric oxide deficiency reduces neuronal calpain nitrosylation and results inenzyme 
activation, which, in turn, leads to tau phosphorylation by activating cyclin-dependent 
kinase 5. Salt-induced cognitive impairment is not observed in tau-null mice or in mice 
treated with anti-tau antibodies, despite persistent cerebral hypoperfusion and 
neurovascular dysfunction. These findings identify a causal link between dietary salt, 
endothelial dysfunction and tau pathology, independent of haemodynamic 
insufficiency. Avoidance of excessive salt intake and maintenance of vascular health 
may help to stave off the vascular and neurodegenerative pathologies that underlie 
dementia in the elderly. 


Vascular risk factors, including excessive salt consumption, have long 
been associated with cerebrovascular diseases and cognitive impair- 
ment’?. A diet rich in salt is an independent risk factor for stroke and 
dementia?* © and has been linked to the cerebral small vessel disease 
that underlies vascular cognitive impairment”, a condition that is asso- 
ciated with endothelial dysfunction and reduced cerebral blood flow 
(CBF)”. In mice, a high-salt diet (HSD) induces cognitive dysfunction by 
targeting the cerebral microvasculature through a gut-initiated adap- 
tive immune response mediated by T,,17 lymphocytes’. The resulting 
increase in circulating IL-17 leads to inhibition of endothelial nitric oxide 
synthase (eNOS) and reduced vascular production of nitric oxide, which 
in turnimpairs endothelial vasoactivity and lowers CBF by about 25%’. 
However, it remains unclear how hypoperfusion, resulting from an HSD 
or other vascular risk factors, leads to impaired cognition. The prevail- 
ing viewis that hypoperfusion compromises the delivery of oxygen and 
glucose to energy-demanding brain regions that are involved in cogni- 
tion’, However, the relatively small reduction in CBF that is associated 
with an HSD in mice’ and vascular cognitive impairment in humans“ 
may not be sufficient to impair cognitive function”, suggesting that 
vascular factors beyond cerebral perfusion are involved. 

Excessive phosphorylation of the microtubule-associated protein 
tau promotes the formation of insoluble tau aggregates, which are 
thought to mediate neuronal dysfunction and cognitive impairment 
in Alzheimer’s disease and other tauopathies’®. However, accumula- 
tion of tau has also been detected in cerebrovascular pathologies 


associated with endothelial dysfunction and cognitive impairment”. 


Therefore, we investigated whether tau accumulation rather than 
cerebral hypoperfusion contributes to the cognitive dysfunction 
induced by an HSD. First, we investigated whether an HSD induces 
phosphorylation of tau. Male CS6BI/6 mice were fed anormal diet or 
an HSD (4 or 8% NaCl—a commonly used model of excessive dietary 
salt corresponding to a 8-16-fold increase in salt content compared 
to regular mouse chow)”. Phosphorylation of tau epitopes that pro- 
mote aggregation of tau and neuronal dysfunction” was assessed 
over time by western blotting. An HSD (8% NaCl) induced a sustained 
increase in phosphorylated tau (p-tau; detected using AT8 (pSer202 
and pThr205) and RZ3 (pThr231) antibodies) in the neocortex and 
hippocampus without increasing total tau (detected using Tau 46; 
Fig. 1a). In the hippocampus, there was also an increase in tau phos- 
phorylation measured using PHF13 and pSer199Ser202 antibodies 
(Extended Data Fig. 1a). AT8 tau phosphorylation was abolished by 
A-protein phosphatase (Extended Data Fig. 1b). AT8 and RZ3 immu- 
noreactivity were also increased in the neocortex of female mice fed 
an HSD (Extended Data Fig. 1c). The HSD did not increase acetyla- 
tion of tau at K280, a post-translational modification that has been 
implicated in tau pathology’® (Extended Data Fig. 1a). AT8 and MC1 
immunoreactivity was detected in the piriform cortex, but we found 
no neurofibrillary tangles (Fig. 1b, Extended Data Fig. 1d, e). We found 
neither neuronal or white-matter damage, nor significant changes 
in astrocytes, microglia/macrophages or pericytes (Extended Data 
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Fig. 1| HSD increases tau phosphorylation and insoluble tau. a, HSD 
increases AT8 and RZ3 immunoreactivity (cortex: AT8, normal diet (ND)/HSD 
n=8/9,*P< 0.0001; RZ3, ND/HSD n=12/11,*P< 0.0001; hippocampus (Hipp): 
AT8, ND/HSD n=9/9, *P< 0.0001; RZ3, ND/HSD n= 9/9, *P= 0.0011; two-tailed 
unpaired t-test for HSD versus ND). Inall figures and legends, asterisks denote 
significant differences. b, HSD increases neuronal AT8 immunoreactivity in the 
piriform cortex. Right, magnified view of boxes on left (scale bars, 500 pm (left); 
100 pm (right)). Representative images (n=5 mice per group).c, Time course of 
neocortical increase in AT8 and RZ3. AT8, 4 weeks: ND/HSD n= 4/5, *P= 0.0116; 
8 weeks: ND/HSD n= 9/8, *P= 0.0066; 24 weeks: ND/HSD n=8/9, *P=0.0152; 36 
weeks: ND/HSD n=4/5, *P= 0.0087; RZ3, 4 weeks: ND/HSD n= 4/5, *P=0.0097; 
8 weeks: ND/HSD n= 7/8, *P=0.0084; 24 weeks: ND/HSD n=8/9, *P=0.0135; 

36 weeks: ND/HSD n=4/5, *P= 0.0204; two-tailed unpaired t-test for HSD versus 
ND. d, HSD induces deficits in recognition memory. Diet: *P< 0.0001, time: 
*P=0.0002;8 weeks: ND/HSD n= 8/11; 12 weeks: ND/HSD n=16/12; 24 weeks: 
ND/HSD n=14/13 mice per group, two-way ANOVA and Tukey’s test. e, HSD 
induces deficits in spatial memory. Diet: *P=0.0048, time: *P< 0.0001; ND/HSD 
n=13/12, two-way repeated measures ANOVA and Bonferroni's test; primary 


Fig. 2a-c). Increased AT8 immunoreactivity was also observed in the 
neocortex of mice fed with the 4% HSD (Extended Data Fig. 1f). 
Inthe neocortex, AT8 immunoreactivity increased after 4 weeks and 
RZ3 immunoreactivity after 8 weeks of HSD, and both remained elevated 
for up to 36 weeks, whereas in the hippocampus AT8 immunoreactivity 
peaked at 12 and 36 weeks (Fig. Ic, Extended Data Fig. 1g). Starting after 
12 weeks of HSD, mice exhibited difficulties in recognizing novel objects 


latency day 5, ND/HSD n=13/12,*P=0.0031 versus ND, two-tailed unpaired 
t-test. f, Neocortical and hippocampal levels of ATS correlate with spatial 
learning impairment. AT8 cortex: Barnes maze r=0.4491, *P< 0.0001, n=84; 
novel object recognition (NOR) r=—0.2621, *P= 0.0188, n=80; AT8 
hippocampus: Barnes maze r= 0.2073, *P=0.0462,n=93; NORr=—0.2915, 
*P=0.0053,n=90;Pearson’s correlation coefficient. g, HSD increases levels of 
insoluble tau (western blotting) extracted in RIPA and FA fractions after 

12 weeks. Cortex: RIPA, ND/HSD n= 7/10, *P= 0.0032; FA, ND/HSD n=8/7, 
*P=0.0146; hippocampus: RIPA, ND/HSD n=7/9, *P= 0.0418; FA, ND/HSD n=7/9, 
*P=0.0494, two-tailed unpaired t-test for HSD versus ND. h, HSD increases levels 
of insoluble tau (electrochemiluminescence). Cortex: RIPA, ND/HSD n=11, 
*P=0.0050; FA, ND/HSD n=14/11, *P= 0.0028; hippocampus: RIPA, ND/HSD 
n=6/8,*P=0.0380; FA, ND/HSD n=7/8, *P= 0.0037; two-tailed unpaired f-test 
for HSD versus ND. i, HSD shifts tau from the RAB fraction to the less soluble 
RIPA and FA fractions. Cortex: ND/HSD n= 9/8, RAB, P=0.4234, RIPA, P=0.5414, 
FA, *P=0.0325; hippocampus: ND/HSD n=5/6, RAB, P= 0.2468, RIPA, P= 0.3290, 
FA, *P= 0.0152; two-tailed unpaired t-test for HSD versus ND. For gel source data 
see Supplementary Fig. 1. Dataare expressed as mean+s.e.m. 


and developed a deficit in spatial memory for the Barnes maze (Fig. 1d, 
e, Extended Data Fig. 3a). Female mice fed an HSD also showed cognitive 
dysfunction (Extended Data Fig. Ic). The magnitude of phosphoryla- 
tion at the AT8 and RZ3 epitopes was correlated with performance on 
the Barnes maze (Fig. If) and novel object recognition (Extended Data 
Fig. 3b). The HSD did notincrease levels of amyloid-B (ABs, AB4o OF ABy>) 
in the neocortex (Extended Data Fig. 3c). p-tau was also increased in 
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Fig. 2| The NO precursor L-arginine prevents the increase in p-tau induced by 
HSD. a-c, Administration of L-arginine (10 gin drinking water), starting at 
week 8 of HSD and continued through week 12, suppresses AT8 accumulationin 
neocortex (a) and hippocampus (b); quantified inc. Cortex: vehicle (Veh), ND/HSD 
n=10/10, L-arginine (L-arg), ND/HSD n=16/21, *P=0.0045; hippocampus: Veh, 
ND/HSD n=10/8, *P=0.0067, L-arg, ND/HSD n= 7/12, two-tailed unpaired t-test 
for HSD versus ND. d, e, L-Arginine treatment reduces the cognitive deficits 
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induced by HSD in both the NOR test (Veh: ND/HSD n =12/10; L-arg: ND/HSD 
n= 6/11; diet: *P= 0.0156, treatment: *P= 0.0406; two-way ANOVA and Tukey’s 
test) and the Barnes maze (primary latency, diet: *P= 0.0182, time: *P< 0.0001, 
two-way repeated measures ANOVA and Tukey’s test; primary latency day 5, 
*P=0.0439; Kruskal-Wallis test). For gel source data see Supplementary Fig. 1. 
Data are expressed as mean +s.e.m. 
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Fig. 3|HSD induces activation of calpain and CDKS associated with calpain 
denitrosylation. a, HSD did not alter expression of calpain 1 or 2 (ND/HSD, 
n=10), butincreased enzyme activity. ND/HSD n= 9/9, *P=0.0404 versus ND, 
two-tailed unpaired t-test. b, HSD increases the cleavage of p35 into p25. 
ND/HSD n=8/8, *P= 0.0426 versus ND, two-tailed unpaired t-test.c, HSD 
increases the level of CDK5 bound to p35p25 (ND/HSD n=10/10, *P= 0.0347) and 
CDKS activity (ND/HSD n=10/10, *P= 0.0274; two-tailed unpaired t-test for HSD 
versus ND). d, The CDKS peptide inhibitor TFP5 counteracts the HSD-induced 
increase in AT8 and RZ3 immunoreactivity. AT8, cortex: ND/HSD scrambled 
n=10/9, ND/HSD TFP5n=9/9, diet: *P< 0.0001, treatment: *P=0.0164; AT8, 
hippocampus: ND/HSD scrambled n=11/10, ND/HSD TFP5n=10/7, diet: 
*P=0.0004, treatment: *P= 0.0360; RZ3, cortex: ND/HSD scrambled n=10/10, 
ND/HSD TFP5 n=10/11, diet: *P< 0.0001, treatment: *P= 0.0814; RZ3, 
hippocampus: ND/HSD scrambled n=12/11, ND/HSD TFPS5 n=10/12, diet: 

*P< 0.0001, treatment: *P= 0.0066; two-way ANOVA and Tukey’s test. e, TFP5 
rescues the spatial memory deficits induced by HSD. Primary latency, diet: 
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P=0.6415, time: *P< 0.0001, two-way repeated measures ANOVA and Tukey’s 
test; primary latency day 5, diet: *P= 0.0016, treatment: P= 0.5797; two-way 
ANOVA and Tukey’s test). TFP5 also improves cognitive performance of HSD-fed 
mice onthe NOR test. Diet: *P= 0.0383, treatment: P= 0.1488; two-way ANOVA 
and Tukey’s test. f, L-Arginine counteracts the increase in calpain (ND/HSD 
n=8/10,*P=0.0335 versus ND, two-tailed unpaired f-test) and CDKS activity 
induced by HSD and reduces CDKS5 bound to p35p25 (ND/HSD n=7/9, *P= 0.0137 
versus ND, two-tailed unpaired t-test). g, Calpain 2 nitrosylation is reduced by 
HSD (ND/HSD n=9/9, diet: *P= 0.0189; ascorbate: *P< 0.0001; two-way ANOVA 
and Tukey’s test) and this effect is reversed by L-arginine. ND/HSD n= 6/6, diet: 
P=0.9487, ascorbate: *P< 0.0001, two-way ANOVA and Tukey’s test. 

h, Nitrosylation is suppressed ineNOS~“ (ND/HSD n= 6/6, genotype: *P=0.0223, 
ascorbate: *P= 0.0021, two-way ANOVA and Tukey’s test), but notinnNOS~ mice 
(ND/HSD n=4/4, genotype: P= 0.0843, ascorbate: P< 0.0001; two-way ANOVA 
and Tukey’s test). For gel source data see Supplementary Fig. 1. Dataare 
expressed as mean+s.e.m. 
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Fig. 4|HSD-induced cognitive dysfunction is not observed intau mice and 
is prevented by tau antibodies despite cerebrovascular insufficiency. a, ATS 
and Tau46 are absent intau” mice in RIPA and heat-stable (HS) RIPA fractions. 
Representative blots fromn=5ND-fed tau” mice. b, c, HSD does not alter 
cognition in tau” mice on the NOR test (wild-type (WT): ND/HSD n= 9/10; tau”: 
ND/HSD n=7/9; diet: *P= 0.0055, genotype: P= 0.1827, two-way ANOVA and 
Tukey’s test) or the Barnes maze (ND/HSD n= 9/10, diet: P= 0.9348, time: 

*P< 0001, two-way ANOVA and Tukey’s test). d, The increase in CBF produced by 
neocortical application of acetylcholine is reduced in tau” mice (ND/HSD wild- 
type, n=6/6, tau’, n=9/9; diet: *P< 0.0001, genotype: P= 0.7920, two-way 
ANOVA and Tukey’s test). e, Anti-tau antibodies (HJ8.8, 50 mg per kg per week, 
intraperitoneal injection) do not prevent the reduction in resting CBF induced 
by HSD (IgG: ND/HSD n=11/9; HJ8.8: ND/HSD n= 6/5; diet: *P= 0.0061, 


mice with arterial hypertension induced by angiotensin-Il (Extended 
Data Fig. 3d, e) orina model of AB accumulation (Tg2576 mice; Extended 
Data Fig. 3f), both of which are associated with endothelial dysfunction 
and cognitive impairment”””°. 

To determine whether an HSD alters the solubility of tau—a key deter- 
minant of its harmful effects’°—we measured tau levels in brain tissue 
after sequential biochemical extraction in RAB (salt buffer), RIPA (deter- 
gent buffer) or 70% formic acid (FA), which contain, respectively, soluble, 
less soluble and highly insoluble tau. In samples taken from mice after 
12 weeks of the HSD, tau in the RIPA and FA fractions was increased over 
that in samples from mice fed a normal diet, reflecting an increase in 
insoluble tau (Fig. 1g-i). Hypothermia, which does not cause cognitive 
impairment, also increased p-tau”, but, unlike the HSD, did not lead to 
an increase in insoluble species (Extended Data Fig. 3g, h). These obser- 
vations indicate that an HSD not only promotes hyperphosphorylation 
of tau, but also its aggregation. 

As the nitric oxide precursor L-arginine counteracts the endothe- 
lial nitric oxide deficit in mice fed an HSD’, mice were given L-arginine 
in their drinking water (10 g 1’) during the last 4 weeks of the 12-week 
HSD course. L-Arginine suppressed accumulation of p-tau and pre- 
vented HSD-induced cognitive dysfunction (Fig. 2a—e, Extended Data 
Fig. 4a), without affecting the HSD-induced increase in circulating IL-17 
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treatment: P= 0.9367, two-way ANOVA and Tukey’s test). f, HJ8.8 does not rescue 
the CBF response to acetylcholine (IgG: ND/HSD n= 5/5; HJ8.8: ND/HSD n=5/5; 
diet: *P=0.0005, treatment: P= 0.8516, two-way ANOVA and Tukey’s test). 

g, HJ8.8 ameliorates the cognitive dysfunction induced by HSD on both the 

NOR test (IgG: ND/HSD n =15/13; HJ8.8: ND/HSD n = 13/15; diet: *P=0.0001, 
treatment: *P= 0.0054, two-way ANOVA and Tukey’s test) and the Barnes maze 
test (primary latency: 1gG: ND/HSD n=19/15; HJ8.8: ND/HSD n=13/14; time: 

*P< 0.0001, diet: *P= 0.0358, two-way repeated measures ANOVA and Tukey’s 
test; primary latency day 5: IgG: ND/HSD n=19/16; HJ8.8: ND/HSD n=13/14; 
*P=0.0202, Kruskal-Wallis and Dunn’s test). Images on left show representative 
tracks. For gel source data see Supplementary Fig. 1. Data are expressed as 
mean+s.e.m. 


(Extended Data Fig. 4b). Consistent with a key role of endothelial nitric 
oxide deficiency in accumulation of p-tau, eNOS-null (Vos3) mice fed 
anormal diet showed elevated p-tau that was not increased further by 
an HSD (Extended Data Fig. 4c, d). 

Cyclin-dependent kinase 5 (CDKS), a kinase that is responsible for 
tau hyperphosphorylation”, is tightly regulated by its binding partner 
p35”, Calpains cleave p35 bound to CDKS into p25, which results in 
dysregulated activation of CDK5 and hyperphosphorylation of tau”, 
As reduced endothelial nitric oxide may lead to tau phosphorylation by 
activating CDK5 via p25“, we investigated whether the HSD influenced 
the activity of calpain and CDKS. Calpain 2 is more abundant than cal- 
pain1inthe neocortex (Fig. 3a) and is colocalized with CDKS in neurons 
(Extended Data Fig. 4e, f). Mice fed an HSD showed increased calpain 
activity compared to mice fed anormal diet, and this led to an increase 
in the p25/p35 ratio and activation of CDKS (Fig. 3a—c). Other CDK5 
substrates besides tau, such as DARPP-32”, were not phosphorylated 
(Extended Data Fig. 4g). Administration of the CDKS peptide inhibitor 
TFP5 (40 mg kg ‘twice per week; intraperitoneal)”° attenuated phospho- 
rylation of tau and prevented cognitive dysfunction (Fig. 3d, e), without 
blunting the HSD-induced increase in circulating IL-17 (Extended Data 
Fig. 4h). L-Arginine prevented the HSD-induced activation of calpain and 
CDKS (Fig. 3f) but did not alter calpain levels (Extended Data Fig. 4). 
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The HSD did not increase the activity of GSK3B, an enzyme that has been 
implicated in tau phosphorylation”, or the expression of the prolyl 
cis/trans isomerase PIN-1, which regulates tau dephosphorylation”” 
(Extended Data Fig. 5a, b). 

When calpain has been activated by Ca”* it is regulated mainly by its 
endogenous inhibitor calpastatin and by nitrosylation by nitric oxide’’, 
which suppresses its activity”’. As the HSD did not reduce calpastatin 
expression (Extended Data Fig. 5c), we used the biotin switch assay to 
investigate the effect of an HSD on calpain nitrosylation. Calpain nitros- 
ylation was reduced in mice fed an HSD compared with mice fed a normal 
diet, and this effect was reversed by L-arginine (Fig. 3g). Nitrosylation was 
markedly suppressed in eNOS-null mice, but notin nNOS-null (Nos1””) 
mice, attesting to the key role of eNOS-derived nitric oxide in regulating 
nitrosylation and activity of calpain (Fig. 3h). As the HSD reduced CDK5 
nitrosylation (Extended Data Fig. 5d), it is unlikely that this modifica- 
tion, which activates the enzyme”, is involved in the effects of the HSD. 

Finally, we used tau-null (Mapt”) mice and anti-tau antibodies to 
examine the relative contributions of p-tau and neurovascular dysfunc- 
tion to the cognitive deficits induced by an HSD. Tau-null mice that 
were fed an HSD for 12 weeks did not exhibit cognitive impairment, 
but still showed a marked attenuation of the increase in CBF evoked by 
neocortical application of acetylcholine (Fig. 4a—d), a response medi- 
ated by eNOS-derived nitric oxide*. Similarly, wild-type mice treated 
with anti-tau antibodies (HJ8.8)” or control IgG (50 mg per kg per week; 
intraperitoneal) for the last 4 weeks of the 12-week HSD regimen showed 
improved cognitive function (Fig. 4g) compared with untreated mice, 
despite having reduced resting CBF and an attenuated CBF response to 
acetylcholine (Fig. 4e, f). HJ8.8 lowered p-tau in the hippocampus (meas- 
ured by AT8 immunoreactivity; Extended Data Fig. 5e) but did not blunt 
the HSD-induced increase in circulating IL-17 (Extended Data Fig. 5f). 
The HSD did not affect the increase in CBF induced by neural activity in 
either tau-null mice or mice treated with HJ8.8 (Extended Data Fig. 5g) 

These observations indicate that the cognitive dysfunction associated 
with HSD is mediated by a deficit in endothelial nitric oxide that results 
from denitrosylation of calpain. This deficit leads to activation of CDK5 
and phosphorylation of tau in neurons (Extended Data Fig. 5h). Nota- 
bly, the hypoperfusion and neurovascular dysfunction that also result 
from the endothelial nitric oxide deficit do not mediate the cognitive 
impairment. Rather, other aspects of endothelial function are involved— 
namely, endothelial nitric oxide maintaining calpain homeostasis and 
preventing CDKS dysregulation and tau hyperphosphorylation. 

Although the HSD used in mice, in which the salt content is 8-16-fold 
higher than in normal mouse chow””, may exceed the highest reported 
levels of human salt consumption (12.5-20 g per day or 3-5 times the 
recommended level of 4-5 g per day)”, our data provide a previously 
unrecognized link between dietary habits, vascular dysfunction and tau 
pathology, independent of cerebral hypoperfusion. Such relationships 
may play arolein the frequent coexistence of vascular and neurogenera- 
tive pathologies in conditions that cause dementia, such as Alzheimer’s 
disease and frontotemporal dementia”. Whereas the avoidance of 
excessive salt consumption may help to prevent tau pathology, thera- 
peutic efforts to counteract cerebrovascular dysfunction need to go 
beyond rescuing cerebral perfusion and target vascular mediators that 
govern the neurovascular interactions that are essential for cognitive 
health. 


Online content 


Any methods, additional references, Nature Research reporting summa- 
ries, source data, extended data, supplementary information, acknowl- 
edgements, peer review information; details of author contributions 


690 | Nature | Vol574 | 31 October 2019 


and competing interests; and statements of data and code availability 
are available at https://doi.org/10.1038/s41586-019-1688-z. 


1. Scarmeas, N., Anastasiou, C. A. & Yannakoulia, M. Nutrition and prevention of cognitive 
impairment. Lancet Neurol. 17, 1006-1015 (2018). 

2. vande Rest, O., Berendsen, A. A., Haveman-Nies, A. & de Groot, L. C. Dietary patterns, 
cognitive decline, and dementia: a systematic review. Adv. Nutr. 6, 154-168 (2015). 

3. Kendig, M. D. & Morris, M. J. Reviewing the effects of dietary salt on cognition: 
mechanisms and future directions. Asia Pac. J. Clin. Nutr. 28, 6-14 (2019). 

4. De Strooper, B. & Karran, E. The cellular phase of Alzheimer’s disease. Cell 164, 603-615 
(2016). 

5. Nation, D. A. et al. Pulse pressure in relation to tau-mediated neurodegeneration, cerebral 
amyloidosis, and progression to dementia in very old adults. JAMA Neurol. 72, 546-553 
(2015). 

6. Kim,H. J. et al. Assessment of extent and role of tau in subcortical vascular cognitive 
impairment using 18F-AV1451 positron emission tomography imaging. JAMA Neurol. 75, 
999-1007 (2018). 

7. Faraco, G. et al. Dietary salt promotes neurovascular and cognitive dysfunction through a 
gut-initiated TH17 response. Nat. Neurosci. 21, 240-249 (2018). 

8. Fiocco, A. J. et al. Sodium intake and physical activity impact cognitive maintenance in 
older adults: the NuAge Study. Neurobiol. Aging 33, 829.e821-829.e28, (2012). 

9. Gardener, H., Rundek, T., Wright, C. B., Elkind, M. S. & Sacco, R. L. Dietary sodium and risk 
of stroke in the Northern Manhattan study. Stroke 43, 1200-1205 (2012). 

10. Blumenthal, J. A. et al. Lifestyle and neurocognition in older adults with cognitive 
impairments: a randomized trial. Neurology 92, €212-e223 (2019). 

11. ~Heye, A. K. et al. Blood pressure and sodium: association with MRI markers in cerebral 
small vessel disease. J. Cereb. Blood Flow Metab. 36, 264-274 (2016). 

12. ladecola, C. The pathobiology of vascular dementia. Neuron 80, 844-866 (2013). 

13. Sweeney, M. D. et al. Vascular dysfunction—the disregarded partner of Alzheimer’s 
disease. Alzheimers Dement. 15, 158-167 (2019). 

14. Shi, Y. et al. Cerebral blood flow in small vessel disease: a systematic review and meta- 
analysis. J. Cereb. Blood Flow Metab. 36, 1653-1667 (2016). 

15. Marshall, R. S. et al. Recovery of brain function during induced cerebral hypoperfusion. 
Brain 124, 1208-1217 (2001). 

16. Wang, Y. & Mandelkow, E. Tau in physiology and pathology. Nat. Rev. Neurosci. 17, 5-21 
(2016). 

17. Zhao, Y. et al. Sodium intake regulates glucose homeostasis through the PPAR6/ 
adiponectin-mediated SGLT2 pathway. Cell Metab. 23, 699-711 (2016). 

18. Min,S. W. etal. Critical role of acetylation in tau-mediated neurodegeneration and 
cognitive deficits. Nat. Med. 21, 1154-1162 (2015). 

19. ladecola, C. et al. SOD1 rescues cerebral endothelial dysfunction in mice overexpressing 
amyloid precursor protein. Nat. Neurosci. 2, 157-161 (1999). 

20. Faraco, G. et al. Perivascular macrophages mediate the neurovascular and cognitive 
dysfunction associated with hypertension. J. Clin. Invest. 126, 4674-4689 (2016). 

21. Arendt, T., Stieler, J. T. & Holzer, M. Tau and tauopathies. Brain Res. Bull. 126, 238-292 
(2016). 

22. Lee, M.S. etal. Neurotoxicity induces cleavage of p35 to p25 by calpain. Nature 405, 
360-364 (2000). 

23. Patrick, G. N. et al. Conversion of p35 to p25 deregulates Cdk5 activity and promotes 
neurodegeneration. Nature 402, 615-622 (1999). 

24. Austin, S. A. & Katusic, Z. S. Loss of endothelial nitric oxide synthase promotes p25 
generation and tau phosphorylation in a murine model of Alzheimer's disease. Circ. Res. 
119, 1128-1134 (2016). 

25. Bibb, J. A. et al. Phosphorylation of DARPP-32 by Cdk5 modulates dopamine signalling in 
neurons. Nature 402, 669-671 (1999). 

26. Shukla, V. et al. A truncated peptide from p35, a Cdk5 activator, prevents Alzheimer’s 
disease phenotypes in model mice. FASEB J. 27, 174-186 (2013). 

27. Kimura, T. et al. Isomerase Pin! stimulates dephosphorylation of tau protein at cyclin- 
dependent kinase (Cdk5)-dependent Alzheimer phosphorylation sites. J. Biol. Chem. 
288, 7968-7977 (2013). 

28. Ono, Y., Saido, T. C. & Sorimachi, H. Calpain research for drug discovery: challenges and 
potential. Nat. Rev. Drug Discov. 15, 854-876 (2016). 

29. Etwebi, Z., Landesberg, G., Preston, K., Eguchi, S. & Scalia, R. Mechanistic role of the 
calcium-dependent protease calpain in the endothelial dysfunction induced by MPO 
(myeloperoxidase). Hypertension 71, 761-770 (2018). 

30. Qu, J. etal. S-Nitrosylation activates Cdk5 and contributes to synaptic spine loss induced 
by B-amyloid peptide. Proc. Natl Acad. Sci. USA 108, 14330-14335 (2011). 

31. ladecola, C. The neurovascular unit coming of age: a journey through neurovascular 
coupling in health and disease. Neuron 96, 17-42 (2017). 

32. Yanamandra, K. et al. Anti-tau antibodies that block tau aggregate seeding in vitro 
markedly decrease pathology and improve cognition in vivo. Neuron 80, 402-414 
(2013). 

33. Powles, J. et al. Global, regional and national sodium intakes in 1990 and 2010: a 
systematic analysis of 24 h urinary sodium excretion and dietary surveys worldwide. BMJ 
Open 3, €003733 (2013). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2019 


Methods 


Most of the methods used in this study are well established in the labo- 
ratory and have been described in detail in previous publications’”°**. 
Here we provide only a brief description. 


Mice 

All procedures were approved by the institutional animal care and use 
committee of Weill Cornell Medicine (animal protocol number: 0807- 
777A). Studies were conducted, according to the ARRIVE guidelines 
(https://www.nc3rs.org.uk/arrive-guidelines), in the following lines 
of mice: C57BL/6 (JAX), B6.129X1-Mapttmi1Hnd (Tau, JAX, Stock 
#007251), B6.129P2-Nos3"™"“/] (eNOS, JAX, Stock #002684), B6.129S4- 
Nosi™?""/} (nNOS, JAX, Stock #002986), 129S6.Cg-Tg(APPSWE) 
2576Kha N20+? (Taconic, Stock #279) and Tg(Camk2a-tTA)I1Mmay 
Ff 14780000 MAPrP301L4S10Khay) (rTo 4510, JAX, Stock#024854). Unless other- 
wise indicated, 8-week-old male mice were used. 


High-salt diet 

Male or female mice (8 weeks old) received normal chow (0.5% NaCl) 
and tap water ad libitum (normal diet (ND)) or sodium-rich chow (4 or 
8% NaCl) and tap water containing 1% NaCl ad libitum (HSD) for 4-36 
weeks as stated’. 


In vivo treatments 

The nitric oxide precursor L-arginine (10 g/l; Sigma) was administered in 
the drinking water starting after 8 weeks of HSD and continuing until 12 
weeks. ND- and HSD-fed mice were treated (intraperitoneally (i.p.), weekly) 
with 50 mg/kg of anti-tau (HJ8.8) or mouse IgG1 isotype control (Clone 
MOPC-21; bioxcell) antibodies for the last 4 weeks of the HSD treatment 
period (12 weeks) before behavioural and cerebrovascular studies. HJ8.8 is 
ahigh-affinity antibody generated against human tau that can recognize 
both human and mouse tau (K, (dissociation constant) = 0.926 nM to 
mouse tau). In other experiments, ND- and HSD-fed mice were treated 
(i.p. twice a week) with 40 mg/kg of TPF5 (KEAFWDRCLSVINLMSSKML- 
QINAYARAARRAARR) or scrambled peptide (GGGFWDRCLSGKGKMSSK- 
GGGINAYARAARRAARR) (Peptide 2.0)® for the last 4 weeks of the HSD 
treatment period (12 weeks) before behavioural and molecular studies. 


General surgical procedures for CBF studies 

Mice were anaesthetized with isoflurane (induction, 5%; maintenance, 
2%). The trachea was intubated and mice were artificially ventilated with 
a mixture of N, and O,. One of the femoral arteries was cannulated for 
recording mean arterial pressure (MAP) and collecting blood samples 
for blood gas analysis*®. Rectal temperature was maintained at 37 °C. 
End tidal CO,, monitored by a CO, analyser (Capstar-100, CWE Inc.), was 
maintained at 2.6-2.7% to provide a pCO, of 30-40 mm Hg and a pH of 
7.3-7.437. After surgery, isoflurane was discontinued and anaesthesia was 
maintained with urethane (750 mg/kg, i.p.) and chloralose (50 mg/kg, 
i.p.). Throughout the experiment the level of anaesthesia was monitored 
by testing motor responses to a tail pinch. 


Monitoring CBF 

Asmall craniotomy (2 x 2 mm) was performed to expose the parietal 
cortex, the dura was removed and the site was superfused with Ringer’s 
solution (37 °C; pH 7.3-7.4)”°. CBF was continuously monitored at the site 
of superfusion with a laser-Doppler probe (Perimed) positioned stere- 
otaxically ~0.5 mm above the cortical surface and connected to a data 
acquisition system (PowerLab). CBF values are expressed as percentage 
increases relative to the resting level. 


Protocol for CBF experiments 

After MAP and blood gases stabilized, CBF responses were recorded’. 
The whisker-barrel cortex was activated for 60 s by stroking the con- 
tralateral vibrissae, and the evoked changes in CBF were recorded. 


The endothelium-dependent vasodilator acetylcholine (ACh; 100 pM, 
Sigma) was superfused onto the exposed neocortex for 5 min and the 
associated changes in CBF were recorded by laser-Doppler flowmetry. 
CBF and MAP data were collected using Chart 5 Pro (v.5.5.6). 


Measurement of resting CBF by ASL-MRI 

CBF was assessed quantitatively using arterial spin labelling magnetic 
resonance imaging (ASL-MRI), performed on a 7.0-Tesla 70/30 Bruker 
Biospec small-animal MRI system with 450 mT/m gradient amplitude 
anda4,500 T/m/s slew rate. A volume coil was used for transmission and 
asurface coil for reception. Anatomical localizer images were acquired 
to find the transversal slice approximately corresponding to bregma 
+0.5 mm. This position was used for subsequent ASL-MRI, which was 
based ona flow-sensitive alternating inversion recovery rapid acquisi- 
tion with relaxation enhancement (FAIR-RARE) pulse sequence labelling 
the inflowing blood by global inversion of the equilibrium magnetiza- 
tion. One axial slice was acquired with a field of view of 15 x 15 mm, spatial 
resolution of 0.117 x 0.117 x 1mm, TE (echo time) of 5.368 ms, effective 
TE of 48.32 ms, recovery time of 10 s and a RARE (rapid imaging with 
refocused echoes) factor of 72. Twenty-two turbo inversion recovery 
values ranging from 30 to 2,300 ms were used, and the inversion slab 
thickness was 4 mm. For computation of resting CBF (rCBF), the Bruker 
ASL perfusion processing macro was used. It uses a published model?” 
and includes steps to mask out the background. The masked rCBF images 
were exported to Analyze format on the MRI console. The ASL images 
were analysed by Image) and the average CBF value is reported as ml 
per 100 g of tissue per minute’. 


Osmotic minipump implantation for delivery of angiotensin-II 

Osmotic minipumps containing vehicle (saline) or angiotensin-II 
(600 ng/kg/min) were implanted subcutaneously under isoflurane anaes- 
thesia. Systolic blood pressure was monitored in awake mice using tail- 
cuff plethysmography”°. Forty-two days later, mice were anaesthetized 
and their brains were collected for assessment of tau phosphorylation. 


Hypothermia 

C57BL/6 mice (12 weeks old) were anaesthetized by injection of 
ketamine/xylazine (100/10 mg/kg). Rectal temperature was continuously 
monitored and kept at 37 °C (normothermia) or 30 °C (hypothermia) 
using a thermostatically controlled heating pad. Mice were killed 30 min 
after anaesthesia and their brains were collected and frozen on dry ice. 
Tissues were kept at -80 °C until processing for immunoblot analysis. 


Immunoblot analysis 

Cortex (~80-90 mg) and hippocampus (~15 mg) isolated from ND- 
and HSD-fed mice were sonicated in 800 and 500 ul of RIPA buffer, 
respectively (SO mM Tris-HCl pH 8.0, 150 mM NaCl, 0.5% deoxycholic 
acid, 0.1% SDS, 1mM EDTA pH 8.0, 1% IGEPAL CA-630, 1 mM Na,VO,, 
20 mM NaF and one tablet per 10 ml of cCOmplete, EDTA-free Protease 
Inhibitor Cocktail, Millipore Sigma) and equal volumes were mixed 
with SDS sample buffer, boiled and analysed on 10% or 10-20% Novex 
WedgeWell gels (Thermo Fisher Scientific). Proteins were transferred to 
PVDF membranes (Millipore), blocked at room temperature for 1h with 
5% milk in TBS, and incubated overnight at 4 °C, with primary antibod- 
ies (see Reporting Summary) in 5% BSA in TBS/0.1% Tween-20 (TBST). 
Membranes were washed in TBST and incubated with goat anti-mouse 
or rabbit secondary antibodies conjugated to horseradish peroxidase 
(Santa Cruz Biotechnology) for 1h at room temperature, and protein 
bands were visualized with Clarity Western ECL Substrate (Bio Rad) ona 
Bio Rad ChemiDoc MP Imaging System. Quantification was performed 
using Image Laboratory v.6.0 (Bio Rad). 


Preparation of heat-stable RIPA fractions for tau enrichment 
After homogenization in cold RIPA buffer and centrifugation, 150 pl of 
the supernatant containing the proteins was boiled at 100 °C for 10 min. 
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Samples were cooled onice for 20 min and then centrifuged at 20,000g 
at 4 °C for 15 min. The supernatant corresponding to the heat-stable frac- 
tion was then harvested. This method is used to isolate proteins resist- 
ant to heat, including tau and other microtubule-associated proteins. 
Thus, endogenous immunoglobulins are precipitated during the boiling 
process and eliminated from the supernatant. The proteins were then 
mixed with equal volumes of SDS sample buffer, boiled and analysed 
on 10% Novex WedgeWell gels (Thermo Fisher Scientific). Although 
tau protein is partially lost during the boiling process, the heat-stable 
samples are enriched with tau (Extended Data Fig. 5h). Furthermore, 
boiling substantially improves the specificity of certain antibodies, 
suchas AT8, RZ3 or MC1*. 


Tau dephosphorylation 

After overnight dialysis to remove phosphatase inhibitors, protein 
samples (40 pl) were incubated with 5 pl of 10x NEBuffer for Protein 
MetalloPhosphatases (PMP), 5 pl of 1O mM MnCl, and 1 pl of Lambda 
protein phosphatase (Lambda PP, New England Biolabs) at 30 °C for3h. 
Reactions were stopped by addition of SDS sample buffer and boiling 
for 5 min at 100 °C. 


Brain tissue protein extraction 

Extraction was performed as described previously”. The cortex (~80- 
90 mg) and hippocampus (~15 mg) of each brain were homogenized by 
sonication in 800 and 300 ul, respectively, of RAB buffer (100 mM MES, 
ImMEDTA, 0.5mM MgSOQ,, 750 mM NaCl, 20 mM NaF, 1mM Na3VO,, sup- 
plemented by EDTA-free Protease Inhibitor Cocktail, Millipore Sigma). In 
brief, the samples were centrifuged at 50,000g for 20 min at 4 °C using 
an Optima MAX-TLA 120.2 Ultracentrifuge (Beckman). The supernatants 
were collected as RAB-soluble fractions and pellets were resuspended 
in identical volumes of RIPA buffer (150 mM NaCl, 50 mM Tris, 0.5% 
deoxycholic acid, 1% Triton X-100, 0.5% SDS, 25 mM EDTA, pH 8.0,20 mM 
NaF, 1mM Na,VO, supplemented by EDTA-free Protease Inhibitor Cock- 
tail, Millipore Sigma), and centrifuged at 50,000g for 20 min at 4 °C. The 
supernatants were collected as RIPA-soluble fractions. The pellets were 
sonicated in 70% FA (300 ul for cortex and 125 ul for hippocampus), 
and centrifuged at 50,000g for 20 min at 4 °C. The supernatants were 
collected as 70% FA fractions. All fractions were stored at —80 °C until 
analysed. For western blotting, an aliquot of 100 ul of the FA fractions 
was evaporated in a Savant SpeedVac concentrator at 45 °C for 1h. The 
samples were resuspended in 100 pI SDS sample buffer with the addition 
of 1 pl of 10 N NaOH, sonicated and then boiled for 5 min. 


Measurement of tau, AB and IL-17 

Tau, AB and IL-17 were measured using an electrochemiluminescence- 
based multi-array method through the Quickplex SQ 120 system (Meso 
Scale Diagnostics LLC). Tau and Af peptide levels (AB38, AB4.0 and AB42) 
were measured in the RAB, RIPA and FA fractions of both cortex and hip- 
pocampus using the MSD Mouse Total Tau (K151DSD) and MSD V-PLEX 
Af Peptide Panel 1 (4G8) (K15199) kits according to the manufacturer’s 
protocol. IL-17A was measured in the serum of ND- and HSD-fed mice 
using the MSD V-PLEX Mouse IL-17A Kit (K152RFD) according to the 
manufacturer’s protocol. 


Immunohistochemistry 

After 12 weeks of ND or HSD, mice were anaesthetized with intraperi- 
toneal pentobarbital (200 mg/kg), and then perfused transcardiacally 
with cold PBS, followed by cold 4% paraformaldehyde (PFA) in PBS. 
The brains were removed and immersed first in 4% PFA overnight and 
then in 70% ethanol for 3 days. Brains were then embedded in paraffin 
and cut into 6-~m sections using a microtome. After rehydration and 
antigen retrieval in preheated citrate buffer (10 pM) for 30 min, brain 
sections were immersed in 3% H,O, and then blocked with 100% Sniper 
(Biocare Medical) for 1h. After blocking, sections were incubated for 
2.5 days at 4 °C with the AT8 and MC1 antibodies (1:250 and 1:100 in 


1:50 Sniper in PBS, respectively) and thereafter processed for 1h with 
the biotinylated secondary antibody in 1% normal donkey serum PBS 
(anti-mouse IgG1, Jackson ImmunoResearch). Reactions were visual- 
ized with the ABC-complex (Vectorlabs) and 3,3-diaminobenzidine. 
A Nikon light microscope was used to visualize the signal associated 
with each antibody. 


Immunofluorescence 

After 12 weeks of ND or HSD, mice were anaesthetized with intraperi- 
toneal pentobarbital (200 mg/kg), and then perfused transcardiacally 
with cold PBS, followed by cold 4% paraformaldehyde (PFA) in PBS. The 
brains were removed and immersed first in 4% PFA overnight. Sections 
(thickness, 40 pum) were cut through the whole brain using a vibratome. 
After blocking with 5% normal donkey serum in 0.5% Triton-X/PBS, sec- 
tions were incubated over the weekend at 4 °C with antibodies against 
NEUN (1:200, mouse, Millipore Sigma, MAB377), GFAP (1:200, mouse, 
Millipore Sigma, G3893), IBA1 (1:200, rabbit, Wako, 019-19741), CD13 
(1:200, goat, R&D Systems, AF2335), calpain 2 (1:100, rabbit, Santa Cruz, 
sc-373966) or CDKS (1:100, rabbit, Santa Cruz, sc-6247) in 0.5% Triton-X 
in PBS and thereafter processed for 2 h with FITC, Cy3 or Cy5 second- 
ary antibodies in 0.5% Triton-X in PBS. An epifluorescence microscope 
(IX83 Inverted Microscope, Olympus) or a confocal microscope (Leica 
TCS SP5) was used to visualize the signal associated with each antibody. 


Fluoro-JadeB staining 

After rehydration, brain sections (6 1m) were immersed in 1% sodium 
hydroxide in 80% alcohol for 5 min, followed by 2 min in 70% alcohol 
and 2 min in distilled water. The slides were transferred to a solution of 
0.06% potassium permanganate for 10 min ona shaker. After rinsing for 
2 min in distilled water, the slides were immersed in the staining solu- 
tion (0.0004% Fluoro-Jade B, Millipore Sigma, in 0.1% acetic acid) for 
20 minin the dark. Finally, the slides were rinsed three times for 1 minin 
distilled water and then placed ona slide warmer, set at approximately 
50 °C, until they were fully dry. The dry slides were cleared by immersion 
in xylene for at least a minute before coverslipping with DPX (Sigma). 


TUNEL staining 

After rehydration and antigen retrieval according to the manufacturer’s 
protocol (In situ Cell Death Detection Kit, Fluorescein, Roche, #11 684 
795 910), brain sections were blocked for 30 min in Tris-HCl, 0.1M pH7.5, 
containing 3% BSA and 20% normal donkey serum and then incubated 
with the TUNEL reaction mixture for 60 min at 37 °C ina humidified 
atmosphere in the dark. After washing, slides were evaluated by an 
epifluorescence microscope (IX83 Inverted Microscope, Olympus). A 
positive control was obtained by pre-incubating brain slices with DNase 
Irecombinant (3,000 U/ml in 50 mM Tris-HCl, pH 7.5, 1 mg/ml BSA), for 
10 min at room temperature, to induce DNA strand breaks. 


Kliiver-Barrera white-matter staining 

The Kliiver—Barrera stain was performed using the Luxol Fast Blue Stain 
Kit (ScyTek Laboratory Inc.). Brains were removed after transcardiac 
perfusion with PBS and 4% PFA and sectioned with a vibratome (thick- 
ness, 40 pm), and the positive (blue-stained) areainthe corpus callosum 
was quantified by ImageJ. 


Thioflavin S staining 

After mounting on slides and post-fixation with 4% PFA in PBS for 10 min, 
coronal brain sections (40 pm) were washed and labelled with 0.05% (w/v) 
thioflavine-S in 50% (v/v) ethanol for 10 minas previously described”. An 
epifluorescence microscope (IX83 Inverted Microscope, Olympus) was 
used to visualize the FITC signal associated with thioflavine-S. 


Calpain activity 
Calpain activity was measured using a Calpain Activity Assay Kit from 
AbCam/*0*, In brief, fresh cortex and hippocampus were homogenized 


inthe extraction buffer provided with the kit, which specifically extracts 
cytosolic proteins without contamination of cell membrane or lysosome 
proteases and prevents auto-activation of calpain during the extrac- 
tion procedure. The fluorometric assay is based on the detection of 
cleavage of the calpain substrate Ac-LLY-AFC. Ac-LLY-AFC emits blue 
light (A,,ax = 400 nm); upon cleavage of the substrate by calpain, free 
AFC emits a yellow-green fluorescence (A,,3, =505 nm), which can be 
quantified using a fluorometer or a fluorescence plate reader. The speci- 
ficity of the signal was confirmed using the calpain inhibitor Z-LLY-FMK 
(100-200 uM). The activity is expressed as relative fluorescent units 
(RFU) per milligram of protein for each sample. 


p35/p25 and GSK3B immunoprecipitation 

Immunoprecipitation was performed with anti-p35p25 (Cell Signaling), 
anti-GSK38 (Cell Signaling) or anti-rabbit monoclonal IgG1 isotype con- 
trol antibodies (Santa Cruz Biotechnology). Samples were incubated 
overnight with the primary antibodies and then with protein-A sepha- 
rose (p35p25) (GE Healthcare Life Sciences) or protein-G Dynabeads 
(GSK3B) (Thermo Fisher Scientific) for 2 h at 4 °C. Precipitates were 
used for measurement of CDKS or GSK3f activity. Immunoprecipita- 
tion was confirmed by loading the samples onto 10% Tris-glycine SDS 
polyacrylamide gels and western blotting as described above. 


Detection of S-nitrosylation with the biotin-switch technique 
S-nitrosylated calpain 2 was detected using the biotin-switch technique, 
as previously described™. In brief, samples were sonicated in 800 ul of 
RIPA buffer containing 0.1mM neocuproine and, after centrifugation, 
protein concentrations were measured. Cysteine thiol groups in1mg of 
proteins were blocked with 10% S-methylmethane thiosulfonate (MMTS) 
(Sigma). After protein-precipitation with 100% acetone, sodium ascor- 
bate was added tothe sample to convert each S-nitrothiol (SNO) toa free 
thiol via a transnitrosation reaction to generate O-nitrosoascorbate. 
Next, each nascent free thiol (previously an SNO site) was biotinylated 
with biotin-HPDP (Pierce). Biotinylated proteins were then pulled down 
using avidin beads and analysed on 10% Novex WedgeWell gels (Thermo 
Fisher Scientific). Before avidin pull-down, asmall fraction of each sam- 
ple was collected to determine protein ‘input’. The degree of pull-down 
correlates with protein S-nitrosylation of calpain 2 or CDKS, which was 
detected with an antibody against the two proteins. Nitrosylation of 
calpain 2 or CDKS is expressed as the ratio between the pull-down signal 
and the input corrected for B-actin levels. 


CDKS and GSK3f activity 

CDKS activity in brain lysates was determined after pull-down with p25/ 
p35 antibody (Cell Signaling) from 500 pg total protein using a synthetic 
histone H1 peptide substrate (PKTPKKAKKL, Enzo Life Sciences). GSK3B 
activity was determined after pull-down with GSK3 antibody (Cell 
Signaling) from 100 pg total protein using phospho-glycogen synthase 
peptide-2a as substrate (Tocris). Phosphorylation reactions were initi- 
ated by mixing bead-coupled CDKS with 40 pl reaction buffer contain- 
ing the following: 50 mM HEPES.KOH (pH 7.4), 5 mM MgCl, 0.05% BSA, 
50 uM substrate, 50 pM cold ATP, 1 mM dithiothreitol, 1x complete 
protease inhibitors without EDTA (Roche Applied Biosciences) and 
5 Ci/mmole ’’P-ATP. Companion reactions for every sample were 
executed in the presence of the CDKS inhibitor ((R)-CR8, Tocris) 
(10 LM) or the GSK3f inhibitor (CHIR 99021, Tocris) (10 pM) to correct 
for non-specific activity. Reactions were incubated at 30 °C for 30 min, 
after which they were terminated by spotting on P81 phosphocellulose 
cation-exchange chromatography paper. Filters were washed four times 
for 2 minin 0.5% phosphoric acid, and the remaining radioactivity was 
quantified in a scintillation counter by the Cherenkov method. 


Novel object recognition test 
The NOR test was conducted under dim light in a plastic box. Stimuli 
consisted of plastic objects that varied in colour and shape but had 


similar sizes****. A video camera mounted on the wall directly above 
the box was used to record the testing session for off-line analysis. Mice 
were acclimated to the testing room and chamber for one day before 
testing. Twenty-four hours after habituation, mice were placed in the 
same box in the presence of two identical sample objects and were 
allowed to explore for 5 min. After an intersession interval of 24h, mice 
were placed in the same box but one of the two objects was replaced 
by anovel object. Mice were allowed to explore for 5 min. Exploratory 
behaviour was later assessed manually by an experimenter blinded to 
the treatment group. Exploration of an object was defined as the mouse 
sniffing the object or touching the object while looking at it. Placing the 
forepaws on the objects was considered as exploratory behaviour but 
climbing on the objects was not. A minimal exploration time for both 
objects (total exploration time) during the test phase (~5 s) was used. 
The amount of time taken to explore the novel object was expressed 
as a percentage of the total exploration time and provides an index of 
recognition memory**, Any-Maze v5.3 was used for collection and 
analysis of the behavioural data. 


The Barnes maze test 

The Barnes maze consisted of a circular open surface (90 cm in diameter) 
elevated to 90 cm on four wooden legs*. There were 20 circular holes 
(5 cm in diameter) equally spaced around the perimeter, and posi- 
tioned 2.5 cm from the edge of the maze. No wall and no intra-maze 
visual cues were placed around the edge. A wooden plastic escape 
box (11 x 6 x 5cm) was positioned beneath one of the holes. Two neon 
lamps and a buzzer were used as aversive stimuli. The Any-Maze 
tracking system (Stoelting) was used to record the movement of 
mice on the maze. Extra-maze visual cues consisted of objects within 
the room (table, computer, sink, door and so on) and the experi- 
menter. Mice were tested in groups of seven to ten, and between tri- 
als they were placed into cages, which were placed in a dark room 
adjacent to the test room for the inter-trial interval (20-30 min). 
No habituation trial was performed. The acquisition phase consisted of 
three consecutive training days with three trials per day with the escape 
hole located at the same location across trials and days. On each trial 
a mouse was placed into a start tube located in the centre of the maze, 
the start tube was raised, and the buzzer was turned on until the mouse 
entered the escape hole. After each trial, mice remained in the escape 
box for 60 s before being returned to their cage. Between trials the maze 
floor was cleaned with 10% ethanol in water to minimize olfactory cues. 
For each trial mice were given 3 min to locate the escape hole, after which 
they were guided to the escape hole or placed directly into the escape 
box if they failed to enter the escape hole. Four parameters of learning 
performance were recorded: (1) the latency to locate (primary latency) 
and (2) enter the escape hole (total latency), (3) the number of errors 
made and (4) the distance travelled before locating the escape hole*. 
Whena mouse dipped its head into a hole that did not provide escape, it 
was considered an error. On days 4 and 5, the location of the escape hole 
was moved 180° from its previous location (reverse learning) and two 
trials per day were performed. Any-Maze v.5.3 was used for collection 
and analysis of the behavioural data. 


Statistics 

Sample size was determined using power analysis based on work previ- 
ously published by our laboratory on the effects of dietary salt on CBF 
regulation and cognitive function’. On these bases, 10-15 mice per group 
were required in studies involving assessment of cognitive function and 
cerebrovascular function’”’. Mice were randomized to the different 
experimental conditions and treatments using the random number 
generator function (RANDBETWEEN) in Microsoft Excel. Analysis of the 
data was performed ina blinded fashion using GraphPad Prism (v.7.0). 
All data were tested for normal distribution using the Shapiro-Wilk test. 
Intergroup differences were analysed using the two-tailed unpaired t-test 
for single comparison or using one- or two-way ANOVA (with Tukey’s or 
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Bonferroni's post-hoc analysis) for multiple comparisons. Non-normally 
distributed data were tested using the Mann-Whitney Utest for single 
comparison or the Kruskal-Wallis test for multiple comparisons. Data 
are expressed as mean +Ss.e.m. and differences are considered statisti- 
cally significant at P< 0.05. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 
Source data include final quantifications from in vivo animal work. 
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Extended Data Fig. 1| See next page for caption. 
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Extended Data Fig. 1| HSD (4 or 8%) induced tau phosphorylation: brain 
localization, sex differences and time course. a, HSD (8% NaCl) increases tau 
phosphorylation on Ser396 (PHF13) and on Serl199Ser202 in the hippocampus 
(HIPP) but notin the neocortex (CTX) (HIPP: PHF13, ND/HSD n=4/5, *P=0.0016; 
Ser’Ser?©’, ND/HSD n= 4/5, *P= 0.0337; two-tailed unpaired t-test versus ND), 
whereas acetylation of tau on Lys280 (K280) is not affected. MC1 
immunoreactivity increases in both neocortex and hippocampus in HSD-fed 
mice but reaches statistical significance only in neocortex (MC1, ND/HSD n=4/5, 
*P=0.0321 versus ND, two-tailed unpaired ¢-test). b, Tau phosphorylation on 
Serl199Ser202 and Ser202Thr205S is abolished after treatment of brain samples 
with lambda phosphatase. c, HSD increases AT8 immunoreactivity (left graph) 
in neocortex and hippocampus of female mice, but RZ3 (middle) increases only 
in neocortex (AT8, cortex: ND/HSD n= 8/10, *P= 0.0159; hippocampus: ND/HSD 
n=10/9,*P=0.0151; RZ3, cortex: ND/HSD n=10/8, *P= 0.0117; two-tailed 
unpaired t-test for HSD versus ND). Right, HSD induces a deficit in NOR in female 
mice (ND/HSD n=8/9, *P= 0.0017 versus ND, two-tailed unpaired ¢-test).d, HSD 
increases AT8 immunoreactivity in neuronal cell bodies of the somatosensory 


cortex (scale bars, 100 pm (main images); 10 pm (insets)) and MC1 
immunoreactivity in neuronal bodies of the pyriform cortex (scale bars, 500 um 
(main images); 100 pm (insets)). Representative images from ND- and HSD-fed 
mice (n=5 per group). e, ThioflavinS staining is not present in mice fed an HSD, 
indicating absence of neurofibrillary tangles, which can be observed in rTg4510 
mice (scale bars, 500 pm (main images); 100 pm (inset)). Representative images 
fromn=5ND- and HSD-fed mice and n=3rTg4510 mice. f, HSD (4%) increases 
AT8 immunoreactivity in the neocortex but not in the hippocampus (AT8, 
cortex: ND/HSD n=5/5, *P= 0.0148 versus ND, two-tailed unpaired t-test). RZ3 
was not increased in both regions. g, Time course of the increase in AT8 and RZ3 
induced by HSD inthe hippocampus. AT8 levels are increased after 4 weeks of 
HSD. RZ3 levels are increased after 4, 12, 24 and 36 weeks of HSD (AT8, 4 weeks: 
ND/HSD n=4/5, *P= 0.0386; 12 weeks: ND/HSD n= 9/9, *P< 0.0001; RZ3, 4 weeks: 
ND/HSD n=4/5,*P=0.0041; RZ3, 12 weeks: ND/HSD n= 9/9, *P= 0.0011; 24 
weeks: ND/HSD n= 7/10, *P= 0.0017; 36 weeks: ND/HSD n=5/4, *P= 0.0188; two- 
tailed unpaired t-test for HSD versus ND). For gel source data see Supplementary 
Fig. 1. Data are expressed as mean +s.e.m. 
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Extended Data Fig. 2| Effect of HSD on neurons, astrocytes, microglia/ 
macrophages, pericytes and white-matter integrity. a, HSD (NaCI 8%) does 
not affect neurons (NEUN), astrocytes (GFAP), microglia/macrophages (IBA-1) 
(cortex: microglia, ND/HSD n=5/5, P=0.0570; hippocampus: ND/HSD n=5/5, 
P=0.0556; two-tailed unpaired t-test for HSD versus ND) or pericytes (CD13) in 
both the pyriform cortex and the hippocampus (scale bars, 500 xm (except 
where specified)). b, No evidence of neuronal cell death is observed in HSD-fed 
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mice by Fluoro-Jade B or TUNEL staining (scale bar, 500 tm). +DNase indicates 
positive control for TUNEL staining. Representative images from ND- and HSD- 
fed mice (n=5 per group). c, Kliiver-Barrera stain shows no white-matter 
damage in the corpus callosum of HSD-fed mice (scale bar, 100 xm). 
Representative images from ND- and HSD-fed mice (n=4 per group). Dataare 
expressed as mean+s.e.m. 
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Extended Data Fig. 3| See next page for caption. 


Extended Data Fig. 3 | AB levels in HSD-fed mice and correlation of 
behavioural deficits with p-tau, as well as p-tau in hypertension, HSD- 
treated tg2576 mice and hypothermia. a, HSD (8% NaCl) does not alter the 
distance travelled before finding the escape hole in the Barnes maze (primary 
distance, ND/HSD n=13/13, diet: *P= 0.0462, time: *P< 0.0001, two-way repeated 
measures ANOVA and Bonferroni's test; primary distance day 5: ND/HSD 
n=13/13, P=0.0670 versus ND, two-tailed unpaired t-test) or the number of 
errors made (primary errors, ND/HSD n=13/13, diet: P= 0.110, time: *P=0.0004, 
two-way repeated measures ANOVA and Bonferroni's test; primary errors day 5: 
P=0.1226 versus ND, two-tailed unpaired t-test). b, RZ3 levels in the cortex 
correlate with cognitive performance on the NOR test. No correlation was found 
between hippocampal RZ3 levels and cognitive performance on either the 
Barnes maze or the NOR test. RZ3 cortex: Barnes maze r= 0.2828, *P=0.0133, 
n=76; NORr=—0.2806, *P=0.0170, n= 72; RZ3 hippocampus: Barnes maze 
r=0.1739, P=0.1470,n=71; NORr=-—0.1746, P=0.1577, n= 67, Pearson’s 
correlation coefficient). c, HSD does not increase soluble or insoluble AB 3s, AB4o 
or AB,,in the neocortex. AB;s, soluble ND/HSD n=11/9, insoluble ND/HSD n=7/6; 
AB 4, Soluble ND/HSD n= 11/14, insoluble ND/HSD n= 7/6; AB, soluble ND/HSD 
n=9/9, insoluble ND/HSD n=7/6. d, Delivery of angiotensin II (ANGII; 

600 ng kg! min“, subcutaneously (s.c.)) via osmotic minipumps over 6 weeks 
increases systolic blood pressure (SBP) and induces cognitive deficits 


(SBP: Veh/ANGII n= 10/10, treatment: *P< 0.0001, time: *P< 0.0001, repeated 
measures two-way ANOVA and Bonferroni's test; NOR: 2 weeks Veh/ANGII 
n=12/12, 4 weeks Veh/ANGIIn=10/11, 6 weeks Veh/ANGII n= 7/7, treatment: 
*P<0.0021, time: *P= 0.0208, two-way ANOVA and Bonferroni's test). 

e, Administration of angiotensin Il increases AT8 and RZ3 immunoreactivity in 
the neocortex but not the hippocampus (cortex, AT8 6 weeks: Veh/ANGII n=4/4, 
*P=0.0324; RZ3 6 weeks: Veh/ANGII n= 5/5, *P= 0.0262; hippocampus, AT8 

6 weeks: Veh/ANGII n=5/5, P= 0.4056; RZ3 6 weeks: Veh/ANGIIn=5/5, 
P=0.0556, two-tailed unpaired t-test versus vehicle). f, HSD increases AT8 and 
RZ3 levels in both the neocortex and the hippocampus of 6-month-old Tg2576 
mice (cortex, AT8:*P< 0.0001; hippocampus, AT8: *P= 0.0153; cortex, RZ3: 
*P<0.0001; hippocampus, RZ3:*P= 0.0239; two-tailed unpaired t-test for HSD 
versus ND). g, Hypothermia induces massive AT8 phosphorylation (cortex: ATS 
n=4/5,*P=0.0159; hippocampus: AT8 n=4/5, *P= 0.0159) and increases MC1 
(cortex: MC1n=4/5, *P=0.0317; hippocampus: MC1n=4/5, *P= 0.0159) and RZ3 
(cortex: RZ3 n=4/5, *P= 0.0201; hippocampus: RZ3 n= 4/5, *P= 0.0453). 
Unpaired two-tailed t-test for hypothermia (HYPO) versus normal temperature 
(NT). h, Unlike HSD (Fig. 1G), hypothermia does not shift tau from soluble to 
more insoluble fractions. For gel source data see Supplementary Fig. 1. Data are 
expressed as mean+s.e.m. 
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Extended Data Fig. 4 | Effect of L-arginine on p-tau and calpain expression, as 
wellas p-tauin eNOS“ mice, calpain and CDKS localization, pPDARPP-32 
with HSD, and IL-17 levels. a, Administration of L-arginine (10 gl in drinking 
water), starting at week 8 of HSD and continuing through week 12, suppresses 
RZ3 levels in the neocortex but not in the hippocampus (cortex: RZ3, ND/HSD 
n=10/10,*P< 0.0001; hippocampus: RZ3, ND/HSD n=10/10, *P=0.0005, two- 
tailed unpaired t-test versus normal diet with vehicle). b, L-Arginine does not 
affect the increase in serum IL-17 induced by HSD (Veh, ND/HSD n= 9/11, 
*P=0.0002 versus ND Veh; L-arg, ND/HSD n=9/8,*P< 0.0001 versus ND L-arg, 
two-tailed unpaired t-test). c, AT8 and RZ3 levels are elevated in the neocortex 
and hippocampus of eNOS“ mice on ND (AT8: cortex, ND/HSD n=5/4, 
*P=0.0029; hippocampus, ND/HSD n= 5/4, *P= 0.0078; RZ3: cortex, ND/HSD 
n=5/4,*P=0.0003; hippocampus, ND/HSD n=5/4, *P= 0.0128, two-tailed 
unpaired t-test versus wild-type mice). d, HSD does not increase tau 


Ser TFPS 


1 2 
Calpain 


phosphorylation in eNOS“ mice (RZ3: hippocampus, ND/HSDn=7/8, 
*P=0.0224 versus ND, two-tailed unpaired t-test). e, Calpain 2 immunoreactivity 
is present in neuronal cell bodies of the somatosensory and piriform cortex 
(scale bars, 500 pm (left); 100 xm (right)). Representative images fromn=3 
mice. f, Colocalization of Calpain 2 and CDK5 in neuronal cell bodies of the 
piriform cortex (scale bars, 50 xm (main images); 10 pm (inset)). Representative 
images fromn=3 mice. g, HSD has no effect on the phosphorylation of the CDK5 
substrate DARPP-32 in neocortex; ND/HSD n=10/10. h, Administration of the 
CDKS peptide inhibitor TFPS has no effect onthe increase in serum IL-17 levels 
induced by HSD (scrambled: ND/HSD n= 5/4, *P=0.0002 versus ND scrambled; 
TFP5: ND/HSD n=7/8, *P< 0.359 versus ND TFP5; two-tailed unpaired ¢-test). 

i, -Arginine does notalter the levels of calpain 1 and 2 in the neocortex or 
hippocampus. ND/HSD n=3/5. For gel source data see Supplementary Fig. 1. 
Data are expressed as mean+s.e.m. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | GSK3B, PIN-1, calpastatin and CDKS5 nitrosylationin 
HSD-fed mice, as well as neurovascular coupling, effect of HJ8.8 on p-tau, 
serum IL-17, and summary. a, HSD has no effect onthe expression or activity of 
GSK3 in the neocortex. ND/HSD n=10/10. b, HSD does not alter the expression 
of the prolyl cis/trans isomerase PIN-1, a regulator of tau dephosphorylation. 
ND/HSD n=5/5.c, The expression of calpastatin, an endogenous inhibitor of 
calpain activity, is not reduced by HSD. ND/HSD n=10/10. d, Nitrosylation of 
CDKSis reduced in the neocortex of HSD-fed mice (ND/HSD n= 9/9, diet: 
*P=0.0143; ascorbate: *P< 0.0001; two-way ANOVA and Tukey’s test). e, HJ8.8 
reduces AT8 in the hippocampus (IgG: ND/HSD n= 13/12; HJ8.8: ND/HSD n= 9/13; 
*P<0.0001, Kruskal-Wallis test and Dunn’s test). RZ3 levels are not altered by 
HJ8.8. f, Administration of HJ8.8 does alter the increase in serum IL-17 levels 
induced by HSD (IgG: ND/HSD n= 9/9, *P= 0.0192 versus ND IgG; HJ8.8: ND/HSD 
n=7/5,*P=0.0421 versus ND HJ8.8, two-tailed unpaired t-test). g, The increasein 
somatosensory cortex CBF induced by neural activity evoked by mechanical 
stimulation of the whiskers is not reduced by HSD in wild-type, tau” or HJ8.8- 


treated mice (wild-type ND/HSD n=5/7, tau” ND/HSD n=9/8; IgG ND/HSD 
n=5/5,HJ8.8 ND/HSD n=5/5).h, Western blotting showing enrichment of tauin 
boiled RIPA neocortical samples (heat-stable fraction, HS). Note that B-actinis 
lost during the boiling process. Representative images fromn=3 experiments. 

i, Cartoon depicting the mechanisms by which HSD leads to tau phosphorylation 
and cognitive impairment. HSD elicits a T,,17 response in the small intestine, 
which leads to an increase in circulating IL-17. IL-17, in turn, suppresses 
endothelial NO production by inducing inhibitory phosphorylation of eNOS at 
Thr495. The NO deficit results in reduced nitrosylation of calpain in neurons, 
and increases in calpain activity, p35 to p25 cleavage, activation of CDK5, and tau 
phosphorylation, which is ultimately responsible for cognitive dysfunction. 

In support of this chain of events, rescuing the endothelial NO deficit with 
L-arginine, lack of tau in tau-null mice, treatment with the CDKS peptide inhibitor 
TFPS or treatment with antibodies directed against tau (Tau ab) prevent the 
cognitive dysfunction. For gel source data see Supplementary Fig. 1. Dataare 
expressed as mean+s.e.m. 
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was used for acquiring phospho-Tau immunostaining images. Microsoft Excel (for Office365) was used for mouse randomization. 


Data analysis Chart 5 Pro (v5.5.6) was used for analysis of CBF data. Any Maze (v5.3) was used for analysis of behavioral data. Biorad Image Lab (v6.0) 
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statistical analysis. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample size was determined according to power analysis based on previous published works published by our lab on CBF regulation and 
behavior. 


Data exclusions No data were excluded. 


Replication At least three independent experiments (with a number of subjects ranging from 3 to 5 each) were performed for all the findings in the 
manuscript. All attempts at replication were successful and are included in the figures. 


Randomization | Mouse randomization was based on the random number generator function (RANDBETWEEN) in Microsoft Excel software. 


Blinding Analysis was performed in a blinded fashion. 


Reporting for specific materials, systems and methods 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Unique biological materials ChIP-seq 
Antibodies Flow cytometry 
Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Antibodies 


Antibodies used pTau Ser199Ser202, Rabbit polyclonal (ThermoFisher #44-768G); pTau Ser202Thr205 (AT8), Mouse monoclonal (ThermoFisher 
#MN1020); pTau Ser396 (PHF13), Mouse monoclonal (Cell Signaling #9632); pThr231 (RZ3) Mouse monoclonal (Gift from Prof. 
Peter Davies); MC-1 Mouse monoclonal (Gift from Prof. Peter Davies); K280, Rabbit polyclonal (Anaspec #AS-56077); pSer396 
(PHF1), Mouse monoclonal (Gift from Prof. Peter Davies); Tau46, Mouse monoclonal (Cell Signaling #4019); p35/p25, Rabbit 
monoclonal, Clone C64B10 (Cell Signaling #2680); Cdk5, Rabbit polyclonal (Cell Signaling #2506); GSK3B, Rabbit polyclonal, Cell 
Signaling (#9315); Calpain 1, Rabbit polyclonal (Cell Signaling #2556); Calpain 2, Rabbit polyclonal (Cell Signaling #2539); Calpain 
2 (IHC), Mouse monoclonal, E10 (Santa Cruz #373966); Calpastatin, Rabbit polyclonal (Cell Signaling #4146); pThr75 DARPP-32, 
Rabbit polyclonal ( Cell Signaling #2301); DARPP-32 Rabbit polyclonal (Cell Signaling #2302); Pin1, Rabbit polyclonal (Cell 
Signaling #3722); B-Actin, Mouse monoclonal (Sigma #A5441); NeuN, Mouse monoclonal (Millipore #MAB377); |ba1, Rabbit 
ployclonal (Wako #WEG2172); GFAP, Mouse monoclonal, clone G-A-5 (Sigma #63893); N/CD13, Goat polyclonal (R&D Systems 
#AF2335). 


Validation 


AT8 and pSer199Ser202 Tau antibody were validated by treating the brain samples with Lambda phosphatase. All phospho-Tau 


antibodies and the K280 antibody were also validated in mice with deletion of Tau. Cdk5 and p35p25 antibodies have been 
validated in immuno-precipitation experiments with IgG control antibody. All the other antibodies were extensively validated by 
us and/or in the literature. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals 


Studies were conducted, according to the ARRIVE guidelines (https://www.nc3rs.org.uk/arrive-guidelines), in the following lines 


of male mice (age: 8weeks): C57BL/6 (JAX), B6.129X1-Mapttm1Hnd (Tau -/-, JAX, Stock #007251), Tg(Camk2a-tTA) 1Mmay 
Fefl4Tg(tetO-MAPT*P301L)4510Kha/J (rTg4510, JAX, Stock#024854), B6.129P2-Nos3tm1Unc/J (eNOS-/-, JAX, Stock #002684), 
B6.129S4-Nositm1Plh/J (nNOS-/-, JAX, Stock #002986), and 129S6.Cg-Tg(APPSWE)2576Kha N20+? (Tg2576 or APPSWE, Taconic, 
Stock #279). 


Wild animals 


Field-collected samples 


The study did not involve wild animals 


The study did not involve field-collected samples. 
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Acquisition 
Imaging type(s) 
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Normalization template 


Noise and artifact removal 


Volume censoring 


Statistical modeling & inference 


Model type and settings 


Effect(s) tested 


Specify type of analysis: [| Whole brain 


Indicate task or resting state; event-related or block design. 


Specify the number of blocks, trials or experimental units per session and/or subject, and specify the length of each trial 
or block (if trials are blocked) and interval between trials. 


State number and/or type of variables recorded (e.g. correct button press, response time) and what statistics were used 


to establish that the subjects were performing the task as expected (e.g. mean, range, and/or standard deviation across 
subjects). 


Arterial Spin Labeling MRI 


7.0 Tesla 70/30 Bruker Biospec small-animal MRI system with 450 mT/m gradient amplitude and a 4500 T - m-1- s-1 
slew rate. 


ASL-MRI was based on a flow-sensitive alternating inversion recovery rapid acquisition with relaxation enhancement 
(FAIR-RARE) pulse sequence labeling the inflowing blood by global inversion of the equilibrium magnetization. 


The resting cerebral blood flow was measured in the cortex and in the hippocampus. 


Not used 


Provide detail on software version and revision number and on specific parameters (model/functions, brain extraction, 
segmentation, smoothing kernel size, etc.). 


If data were normalized/standardized, describe the approach(es): specify linear or non-linear and define image types 
used for transformation OR indicate that data were not normalized and explain rationale for lack of normalization. 


Describe the template used for normalization/transformation, specifying subject space or group standardized space (e.g. 
original Talairach, MNI305, ICBM152) OR indicate that the data were not normalized. 


Describe your procedure(s) for artifact and structured noise removal, specifying motion parameters, tissue signals and 
physiological signals (heart rate, respiration). 


Define your software and/or method and criteria for volume censoring, and state the extent of such censoring. 


Specify type (mass univariate, multivariate, RSA, predictive, etc.) and describe essential details of the model at the first 
and second levels (e.g. fixed, random or mixed effects; drift or auto-correlation). 


We tested the effect of high salt diet on the resting CBF. Specifically, we tested whether anti-tau antibodies restored the 
resting CBF reduction mediated by high salt diet. 
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The cyclic GMP-AMP synthase (cGAS)-STING pathway is a central component of the 


cell-autonomous innate immune system in animals’”. The cGAS protein is a sensor of 
cytosolic viral DNA and, upon sensing DNA, it produces a cyclic GMP-AMP (cGAMP) 
signalling molecule that binds to the STING protein and activates the immune 
response? °. The production of cCGAMP has also been detected in bacteria®, and has 
been shown, in Vibrio cholerae, to activate a phospholipase that degrades the inner 
bacterial membrane’. However, the biological role of cCGAMP signalling in bacteria 
remains unknown. Here we show that cGAMP signalling is part of an antiphage defence 
system that is common in bacteria. This system is composed of a four-gene operon that 
encodes the bacterial cGAS and the associated phospholipase, as well as two enzymes 
with the eukaryotic-like domains E1, E2 and JAB. We show that this operon confers 
resistance against a wide variety of phages. Phage infection triggers the production of 
cGAMP, which—in turn—activates the phospholipase, leading to aloss of membrane 
integrity and to cell death before completion of phage reproduction. Diverged versions 
of this system appear in more than 10% of prokaryotic genomes, and we show that 
variants with effectors other than phospholipase also protect against phage infection. 
Our results suggest that the eukaryotic cCGAS-STING antiviral pathway has ancient 
evolutionary roots that stem from microbial defences against phages. 


Bacterial antiphage immune systems, such as CRISPR-Cas and restric- 
tion modification systems, tend to concentrate in ‘defence islands’ in 
bacterial genomes’; this property has facilitated the discovery of defence 
systems on the basis of their colocalization with known ones? ™. We 
noticed that homologues of the gene that encodes cGAS in V. cholerae 
(dncV; ‘dinucleotide cyclase in Vibrio’) frequently tend to appear near 
defence genes. Out of 637 homologues of this protein that we identi- 
fied througha homology search in 38,167 microbial genomes, we found 
that 417 (65.5%) are located in the vicinity of known defence systems 
(Fig. 1a). It has previously been shown that such a high propensity of 
colocalization with defence genes is a strong predictor that the gene 
under inspection has a role in phage resistance”. These results therefore 
suggest that the gene that encodes the bacterial cGAS participates in 
defence against phages. 

The bacterial cGAS and the effector phospholipase ‘cGAMP-activated 
phospholipase in Vibrio’ (CapV)’ are encoded by adjacent genes in the 
V. cholerae genome and are probably expressed in a single operon’. 
The same operon also contains two additional genes, the presence of 
which next to the capV-dncV gene pair is conserved in the majority of 
cases (96%, 613 out of 637 homologues), suggesting that the putative 
functional defence system that involves bacterial cGAS comprises these 
four genes (Fig. 1a). As has previously been noted””, the two additional 
genes encode proteins with domains that are known to be associated 
with the eukaryotic ubiquitin system: the El and E2 domains that are 
typical of ubiquitin transfer and ligation enzymes, and aJAB domain, 


whichis similar to de-ubiquitinase enzymes that remove ubiquitin from 
target proteins (Fig. 1b). 

To test whether the four-gene operon that contains cGAS is an 
antiphage defence system, we cloned the operon of V. cholerae sero- 
var O1 biovar El Tor (hereafter, V. cholerae El Tor) or the homologous 
operon from Escherichia coli strain TW11681 into the laboratory strain 
E. coli MG1655, which naturally lacks this system. In both cases, the 
putative four-gene system was cloned together with its upstream and 
downstream intergenic regions to preserve promoters, terminators 
and other regulatory sequences. We then challenged E. coliMG1655 
bacteria containing these four-gene systems with an array of phages 
that spans the three major families of tailed, double-stranded DNA 
phages (T2, T4, T6 and P1 from the Myoviridae; h-vir, T5, SECphi18 
and SECphi27 from the Siphoviridae; and T7 from the Podoviridae), 
as well as the single-stranded DNA phage SECphii17, of the Microviri- 
dae family. 

Both the V.-cholerae-derived and the £.-coli-derived four-gene oper- 
ons conferred defence against multiple phages. The system from E. coli 
provided 10-1,800-fold protection against 6 of the 10 phages that we 
tested, andthe system from V. cholerae protected against 2 of the phages 
(Fig. 1c, Extended Data Fig. 1). None of the systems protected against 
the transformation ofa multi-copy plasmid (Extended Data Fig. 2). The 
stronger protective effect that was observed for the E.-coli-derived 
system, as compared to that from V. cholerae, possibly stems from the 
higher compatibility of the former with the £. colihost and the coliphages 
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Fig. 1| Systems containing bacterial cGAS protect against phage infection. 
a, Genes that encode bacterial cGAS are part of a conserved four-gene operon 
that is genomically associated with antiphage defence systems. Representative 
instances of the four-gene putative defence systems that contain cGAS, and their 
genomic environments, are presented. Genes that are known to be involved in 
defence are shown in yellow. Genes of mobile genetic elements are in dark grey. 
RM, restriction modification; TA, toxin-antitoxin. BREX, Hachiman and Wadjet 
are recently described defence systems””. b, Domain organization of the genes 
in the operon that contains cGAS. c, The four-gene operon from V. cholerae El Tor 
or E. coli TW11681 was cloned into E. coliMG1655 (Methods). The efficiency of 
plating is shown for ten phages infecting the control F. coliMG1655 strain (no 
system), the strain with the four-gene operon cloned from E. coli TW11681 


that we used. We therefore proceeded with the £.-coli-derived system 
for further experiments. 

To determine whether the ability to produce and sense cGAMP affects 
defence, we experimented with mutated forms of the system. Deletion 
of either the cGAS-encoding or the phospholipase-encoding genes 
resulted ina complete loss of protection against phage infection (Fig. 1d, 
e, Extended Data Fig. 3a—d). Moreover, point mutations in two essential 
aspartate residues in the cGAMP-producing active site of cGAS (D129A/ 
D131A)’” abolished defence, which suggests that cGAMP productionis 
absolutely necessary for the phage-defensive properties of the system 
(Fig. 1d, e, Extended Data Fig. 3a—d). A S60A point mutation that inac- 
tivates the catalytic site of the CapV phospholipase also rendered the 
system completely inactive against all the phages that we tested, dem- 
onstrating that the cGAMP-controlled phospholipase activity of CapVis 
essential for defence (Fig. 1d, e, Extended Data Fig. 3a—d). These results 
indicate that, as in eukaryotes, the cGAS-cGAMP signalling pathway in 
bacteria participates in antiviral defence. 

We next examined mutations inthe two additional genes in the four- 
gene operon. Defence against the myophage PI was not affected when 
the gene encoding the E1 and E2 domains or the gene encoding the 
JAB domain were deleted, which suggests that these two genes are not 
essential for protection against P1 (Fig. 1d). Accordingly, a construct 
that contained only the capV-dncV gene pair (that is, expressed only 
the phospholipase-cGAS pair) showed complete defence against P1 
(Extended Data Fig. 4), demonstrating that the phospholipase-cGAS 
pair can work as a standalone defence system against some phages. 
However, deletion of the gene that encodes the E1 and E2 domains 
abolished defence against all of the other phages that were tested (T2, 
T4, T5, T6 and A-vir). The presence of an intact gene that encodes the 
JAB-domain protein was necessary for protection against some phages 
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V. cholerae El Tor (system from V. cholerae). Data represent plaque-forming units 
(PFU) per millilitre; bar graph represents average of three independent 
replicates, with individual data points overlaid. d, e, Efficiency of plating of 
phages infecting strains with the wild-type (WT) four-gene operon cloned from 
E. coliTW11681, deletion strains in that operon and strains with point mutations. 
Data represent plaque-forming units per millilitre; bar graph represents average 
of three independent replicates, with individual data points overlaid. Empty 
vector represents a control F. coliMG1655 strain that lacks the system and 
contains an empty vector instead. d, Infection with phage P1. e, Infection with 
phage T2. 


(T4, T5 and T6), but not against others (P1, T2 and A-vir). Point muta- 
tions that are predicted to inactivate the active site of the El domain 
(C493A/C496A) or to inactivate the active site of the peptidase in the 
JAB domain (D38A) recapitulated the results of deletions of the entire 
genes, suggesting that the predicted enzymatic activities of the E1-E2 
and JAB proteins are necessary for their roles in phage defence (Fig. 1d, 
e, Extended Data Fig. 3a—d). Our mutational analyses suggest that the 
four-gene operon forms a bacterial system that relies on CGAMP sig- 
nalling and phospholipase activity to defend against a broad range of 
phages, and that the activities of the two additional genes are necessary 
for defence against some—but not all—phages. We denote this system 
the cyclic-oligonucleotide-based antiphage signalling system (CBASS). 

In animals, the sensing of viral infection is the trigger that activates the 
production of cGAMP®. We therefore sought to examine whether phage 
infection triggers the production of cGAMP in the bacterial system as 
well. To this end, we took advantage of the fact that the CapV phospho- 
lipase is activated by cGAMP, and used this protein as a reporter for the 
presence of cGAMP in infected cells. We first expressed and purified 
the CapV phospholipase from the £. coli CBASS system, and measured 
its activity using an in vitro phospholipase activity assay. As expected, 
chemically synthesized 3’3’-cGAMP induced CapV phospholipase activ- 
ity in a concentration-dependent manner (Fig. 2a). Next, we exposed 
the purified CapV to cell lysates that were derived from cells containing 
the AcapV CBASS, and which had been filtered to include small mol- 
ecules only (Methods). The CapV phospholipase became markedly 
activated when exposed to cell lysates that were collected 40 min after 
infection with phage P1, which suggests that phage infection triggers 
cGAMP accumulation in cells that contain the CBASS (Fig. 2b). These 
results were corroborated by targeted mass spectrometry analysis, 
which detected cCGAMP concentrations of 1.41.9 1M in lysates that were 
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Fig. 2| Phage infection triggers cCGAMP accumulation and cell death. 

a, Purified CapV protein was incubated in vitro with synthetically produced 
3’3’-cGAMP in the presence of resorufin butyrate, a phospholipase substrate 
that emits fluorescence when hydrolysed. Thex axis shows the concentration of 
cGAMP added (1M); the y axis shows the enzyme activity rate, measured by the 
accumulation rate of fluorescence units (FU) per second (Methods). Bar graph 
represents average of three technical replicates, with individual data points 
overlaid (except for the 2.5 4M concentration, for which the data represent an 
average of two replicates). b, Purified CapV protein was incubated in vitro with 
filtered celllysates derived from bacterial cultures infected by phage Plat 
amultiplicity of infection (MOI) of 2. Lysates were extracted from cells 
containing the CBASS with the capV gene deleted (‘with system’) or from control 
cells that lack the CBASS and contain an empty vector instead (‘nosystem’). 
Lysates were collected 30 min or 40 min after initial infection. Bar graph 


collected 40 min after infection (Fig. 2c). CapV was not activated when 
exposed to lysates that were collected from uninfected cells, and only 
slightly activated by lysates that were collected from cells 30 min after 
infection. This implies that the phage component or cell state sensed 
by the CBASS appears in the cell relatively late in the phage-infection 
cycle (Fig. 2b). 

It has previously been shown that, when stimulated by cGAMP, the 
CapV phospholipase degrades the membrane of V. cholerae, which leads 
toa loss of membrane integrity and to the arrest of cell growth or to 
death’. This, combined with our finding that the production of cGAMP 
is triggered by phage infection, led us to hypothesize that this defence 
system executes its defence via abortive infection. Abortive-infection 
defence systems exert their activity by causing the infected bacterial cell 
to ‘commit suicide’ before the phage replication cycle can be completed. 
This strategy eliminates infected cells from the bacterial population 
and protects the culture from a viral epidemic". A phenotype such as 
this predicts that, with a high multiplicity of infection (MOI) (in which 
nearly all bacteria are infected in the first cycle), massive cell death will 
be observed inthe culture, even for cells that contain the defence system. 

To test this hypothesis, we infected bacteria growing in liquid cultures 
with phage P1at varying MOls and examined the culture dynamics. At an 
MOlof0.2 (in which only around 20% of bacteria are initially infected by 
the phage) the culture of the wild-type cells collapsed owing to phage 
propagation, whereas the culture of the cells containing the CBASS was 
viable (Fig. 2d). Conversely, at an MOI of 2 (in which almost all bacteria 
are infected by the phage), the culture of wild-type cells and the culture 
of cells containing the CBASS both collapsed, which indicates cell lysis of 
the infected, CBASS-containing cells. Moreover, whereas at an MOI 
of 2 the wild-type culture started collapsing 70 min after initial infec- 
tion (consistent with atime to lysis of 60-70 min after infection, as has 
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represents average of three technical replicates, with individual data points 
overlaid. c, Targeted mass spectrometry analysis was performed on the filtered 
celllysates collected 40 min after initial infection by phage P1 at an MOI of 2. 
Lysates from uninfected samples were taken as control. The y axis represents the 
concentration of 3’3-cGAMP in the cell lysate measured by mass spectrometry, 
as calculated on the basis of a calibration curve with synthetically produced 
3/3’-cGAMP (Methods). Three independent replicates for each condition are 
shown, and each bar represents an individual replicate. d, Growth curves in 
liquid culture for bacteria that contain the CBASS and bacteria that lack the 
CBASS (empty vector), infected by phage P1at 37 °C. Bacteria were infected at 
time O at an MOI of 0.2 or 2. OD 9, optical density at a wavelength of 600 nm. 
Three independent replicates are shown for each MOI, and each curve shows an 
individual replicate. e, Model for CBASS-based antiphage activity in bacteria. 


previously been reported for phage P1*), the culture that contained the 
CBASS started collapsing as early as 45 min after infection—a period 
that is insufficient for the completion of the phage P1 replication cycle 
(Fig. 2d). These results were reproduced for other phages; when phages 
were applied at a high MOI, the cells that contained the CBASS showed 
lysis before the necessary time had elapsed for the completion of the 
phage replication cycle (Extended Data Fig. 5a). In the case of phage 
P1, the same phenotype was maintained for constructs that included 
only capV-dncV gene pair (that is, without the genes encoding the E1-E2 
domains and JAB domain proteins), indicating that the latter two genes 
are not necessary for defence by abortive infection against phage P1 
(Extended Data Fig. 5b). 

These results were further corroborated by the staining of infected 
cells with propidium iodide, a fluorescent DNA-binding agent that is not 
membrane-permeable and cannot normally enter cells. Fluorescence- 
activated cell sorting (FACS) analysis showed that at 40 min after infec- 
tion by phage P1a substantial population of the cells that contain the 
CBASS became stained by propidium iodide, which indicates a loss of 
membrane integrity (Extended Data Fig. 6). This was associated with 
the loss of normal cell shape, as was observed via microscopy (Extended 
Data Fig. 7). 

The above results suggest a model in which the cGAS-phospholipase 
defence system somehow senses a phage infection, and this sensing 
then triggers the production of cCGAMP by the cGAS protein. The cGAMP 
molecule in turn activates the phospholipase, which degrades the bac- 
terial membrane and causes cell death (Fig. 2e). This cell death before 
the completion of the phage replication cycle aborts the infection and 
prevents further propagation of the phage. 

Ithas recently been shown that the bacterial cGAS gene (dncV) belongs 
to a large family of oligonucleotide cyclases, members of which are 
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a, Phylogenetic tree of proteins with predicted oligonucleotide cyclase domains 
that were identified by searching a set of 38,167 microbial genomes (Methods). 
Clades are coloured following a previous colour coding”*. The percentage of 
genes that are located near known defence systems is indicated for each clade. 
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found in about 10% of all sequenced bacterial genomes”. Other genes in 
this family synthesize a diverse set of cyclic oligonucleotide molecules, 
including cyclic UMP-AMP, cyclic di-UMP and even the cyclic trinucleo- 
tide AMP-AMP-GMP**. This family of genes has been divided into several 
clades on the basis of sequence similarity between its members". We 
found that all major clades of this gene family havea high propensity to 
be genomically associated with other known defence genes in defence 
islands (Fig. 3a). Between 44% and 74% of the genes in each clade are 
located near known defence systems, which suggests that this entire 
family of oligonucleotide cyclases participates in phage defence (Fig. 3a). 

When examining the genomic environment of the 6,232 genes that 
we identified as belonging to this family (Supplementary Table 1, Meth- 
ods), we found that in 26% of the cases (1,612 instances) they appeared 
as part of a four-gene operon that resembled the CBASS of V. cholerae 
(including the genes encoding the E1-E2 domains and the JAB-domain 
proteins). In 683 (42%) of these operons the predicted effector gene had 
a phospholipase domain, but in the remaining cases the phospholipase 
domain in the effector gene was replaced by another domain (Fig. 3b). 
Common alternative effector domains included endonucleases (usu- 
ally of the HNH type); adomain comprising predicted transmembrane 
helices; a domain of unknown function (DUF4297) that has previously 
been shown to participate in the Lamassu phage-resistance system”; and 
a Tollinterleukin receptor (TIR) domain that was also previously shown 
to participate in antiphage defence” (Fig. 3b). These observations imply 
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configurations of four-gene and two-gene CBASSs. The identified number of 
instances of each such system, and the percentage occurrence next to known 
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that in these variants of the CBASS, the alternative domains replace the 
phospholipase in exerting the cell-suicide effector activity. For example, 
the endonuclease may degrade cellular DNA and the transmembrane- 
domain effector may oligomerize to form membrane pores, as in the 
case of the RexA-RexB abortive infection system”. 

In addition, and consistent with previous reports'®, we found 
2,745 cases in which the genes that belong to the family of oligonucleo- 
tide cyclases appeared in the context of a two-gene operon that lacked 
the genes encoding the E1-E2 domains and JAB-domain proteins. The 
second gene in the operon also frequently included phospholipase, 
endonuclease or transmembrane helix domains, which suggests that 
these operons represent a minimal CBASS that comprises only two genes 
(Fig. 3b). The most common predicted effector domain in these operons 
was a domain that included either two or four transmembrane helices; 
effector domains of this type are present in over 2,000 instances of the 
two-gene operons that we identified. 

To test whether predicted two-gene CBASSs have a role in phage resist- 
ance, we examined a two-gene system from Bacillus cereus VD146 that 
contains a predicted effector gene with four transmembrane helices 
(Extended Data Fig. 8a). We engineered this system into the laboratory 
strain Bacillus subtilis BEST7003 and challenged the engineered strain 
with an array of 11 Bacillus phages, as previously described” (Methods). 
The system conferred strong defence against one of these phages (the 
myophage SBSphiC), verifying its ability to defend against phages 


(Extended Data Fig. 8b). Deletion of the cGAS-like gene or of the effec- 
tor gene rendered the system inactive (Extended Data Fig. 8b). Finally, 
infection assays of bacteria that contained the two-gene system in liquid 
culture showed culture collapse at a high MOI, consistent with an abor- 
tive infection system (Extended Data Fig. 8c). 

We observed 32 instances of a two-gene CBASS in which the effector 
gene contained an N-terminal TIR domain, and also a C-terminal STING 
domain (Fig. 3b, Extended Data Fig. 9). Such a domain arrangement 
of STING fused to a TIR domain is also found in primitive eukaryotes, 
including the oyster Crassostrea gigas and the annelid worm Capitella 
teleta’. Itis therefore possible that the CBASS with the TIR-STING effec- 
tor gene represents the ancient evolutionary origin of the eukaryotic 
cGAS-STING system. 

This study reveals the biological role of a large family of defence 
systems that is widespread in microbial genomes, but much remains 
unknown. The exact phage component sensed by the system is yet to 
be identified; it is unlikely that this component is cytoplasmic double- 
stranded DNA (as is the case in the animal cCGAS-STING system), because 
bacteria do not have a nucleus and their cytoplasm therefore always 
contains double-stranded DNA. The role of the genes encoding the E1-E2 
domains and JAB-domain proteins also remains unknown. These genes 
could be involved in the sensing of some phages, or inthe mitigation of 
anti-cGAS activities that phages are likely to encode. 

Accumulating evidence suggests that important components of the 
eukaryotic innate immune system have counterparts in bacterial immune 
systems. Argonaute—a central protein in the antiviral RNA interference 
machinery of plants, insects and animals’—has also been reported to 
have immunity roles in bacteria and archaea”°”., TIR domains, which are 
essential components of the pathogen-recognizing Toll-like receptors”, 
are abundant in bacteria and have recently been shown to havea primary 
role in antiphage defence". Foreign RNA is sensed in eukaryotic cells by 
the oligo-adenylate synthase (OAS) protein, leading to the activation 
of anon-specific RNase”’; this process has recently been shown to have 
parallels in type III CRISPR-Cas immunity upon the sensing of phage 
RNA”. All of these processes, in addition to our finding that cGAS sig- 
nalling in prokaryotes has an antiviral role similar to that in eukaryotes, 
are unlikely to have been the result of parallel evolution. Instead, these 
observations cumulatively point to a scenario in which these defence 
systems first evolved in prokaryotes as means of defence against phages, 
and the ancient eukaryote (which was probably formed by fusion of a 
bacterium and an archaeon”) inherited a primordial version of these 
systems from the prokaryotes that formed it. Under this hypothesis, 
these systems became the basis for the primitive immune system of the 
ancient eukaryote, and have evolved into the cell-autonomous immune 
system that we know today. If this hypothesis is correct, future studies 
may find homologues of additional components of the human immune 
system functioning as phage resistance systems in bacteria. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Genomic identification and analysis of DncV homologues 

The protein sequence of DncV (NCBI accession NP_229836) was searched 
against the protein sequences of all genes in 38,167 bacterial and archaeal 
genomes downloaded from the Integrated Microbial Genomes (IMG) 
database” in October 2017, using the ‘search’ option in the MMseqs2 
package” (release 6-f5alc) with default parameters. Hits with ane value 
less than1 x 10° were taken as homologues. The fraction of homologues 
found inthe vicinity of known defence systems was calculated as previ- 
ously described", using a positive set of defence gene families that was 
updated to include a recently discovered set of defence genes”. 


Genomic analysis of oligonucleotide cyclase genes 

For the analysis in Fig. 3, the proteins from the database of 38,167 bacte- 
rial and archaeal genomes were first clustered using the ‘cluster’ option 
of MMseqs2” (release 2-1c7a89), with default parameters. Clusters were 
further aggregated into larger clusters using four additional cycles of 
clustering, in which—in each cycle—a representative sequence was taken 
from each cluster using the ‘createsubdb’ option of MMseqs2 and rep- 
resentative sequences were clustered using the ‘cluster’ option with the 
‘-add-self-matches’ parameter. For the first additional clustering cycle, 
the ‘cluster’ option was run with default parameters; for the additional 
cycles 2-4, clustering was run with sensitivity parameter ‘-s 7.5, and for 
the additional cycle 4, the ‘-cluster-mode 1’ parameter was also added. 

The sequences of each cluster were aligned using Clustal Omega”’. 
Each multiple sequence alignment was scanned with HHpred” using 60% 
gap rule (-M 60) against the PDB_ mmCIF70* and pfam31 databases. 
Clusters with HHpred hits to one of the cGAS entries (Protein Data Bank 
(PDB) codes: 4LEV, 4MKP, 4067, 5VDR, 5V8H, 4LEW, 5VDP, 4KMS, 4068, 
4069, 5V8J, 5V8N, 5V80, 5VDO, 5VDQ, 5VDS, 5VDT, 5VDU, 5VDV, 5VDW, 
4XJ5, 4XJL,4XJ6,4XJ3 and 4XJ4, and pfam PFO3281) with >90% probability 
in the top 30 hits were taken for manual analysis. Clusters containing 
genes that were suspected to belong to toxin-antitoxin gene pairs were 
discarded. Overall, this procedure identified 30 clusters containing 
6,232 predicted oligonucleotide cyclase genes from 5,150 genomes. 

The genomic environments spanning 10 genes upstream and down- 
stream of each of the 6,232 predicted oligonucleotide cyclase genes were 
searched to identify conserved gene cassettes and known defence genes 
around the oligonucleotide cyclase genes, as previously described". 
Predicted systems were manually reviewed and unrelated genes (for 
example, mobilome genes and genes of other defence systems) were 
omitted. 

To generate the phylogenetic tree in Fig. 3a, the ‘clusthash’ option of 
MMsegs2 (release 6-fSalc) was first used to remove protein redundancies 
(using the ‘-min-seq-id 0.9’ parameter). Sequences shorter than 200 
amino acids were also removed. To further remove outlier sequences, an 
all-versus-all search was conducted using the ‘search’ option in MMseqs2, 
and proteins with less than 20 hits were manually examined. Overall, 
17 outlier proteins were removed this way. The human cGAS protein 
(UniProt Q8N884) was added, as well as the human oligoadenylate syn- 
thase genes (UniProt PO0973, P29728 and Q9Y6KS); these were used 
as an outgroup. Sequences were aligned using MAFFT”* with gap open 
penalty of ‘-op 2’. The FastTree software” was used to generate a tree 
from the multiple sequence alignment using default parameters. The 
iTOL® software was used for tree visualization. 


Cloning of the CBASS into E. coliMG1655 

The following full-length constructs containing the CBASS were 
designed: the four-gene CBASS from V. cholerae El Tor N16961 (GenBank 
accession NC_002505) was identified as locus tags VCO178-VCO181, 


and the operon was taken together with its upstream and downstream 
intergenic regions, spanning the nucleotide range 178,424-183,957 
in GenBank accession NC_00250S. The four-gene system from E. coli 
TW11681 (GenBank accession AELDOOOO0000) was identified as locus 
tags ESGDRAFT_00026-ESGDRAFT_00029 and the operon, together 
with its upstream and downstream intergenic regions, spanned the 
nucleotide range 21,738-27,072 in GenBank accession AELDOOOOOO00. 
These two constructs were commercially synthesized and cloned by 
Genscript directly into the plasmid pSG1-rfp°" between the Ascl and 
Notl sites of the multiple cloning site. 

For strains with gene deletions and point mutations, plasmids contain- 
ing systems with these deletions or mutations were also commercially 
synthesized and cloned into the plasmid pSGI1-rfp by Genscript, except 
for one construct in which the genes for both the E1-E2 domains and 
JAB domain were deleted (used for Extended Data Figs. 4, 5b). To build 
this construct, the pSG1-rfp backbone was amplified with primers P1-Fw 
+P1-Rv using KAPA HiFi HotStart ReadyMix (Kapa Biosystems KK2601) 
(Supplementary Table 2). Primers P2-Fw + P2-Rv were used to lift the gene 
pair capV and dncV with their native promoter from the plasmid that 
contained the full system derived from £. coliTW11681 (Supplementary 
Table 2). After gel purification of the PCR products with Zymoclean Gel 
DNA Recovery Kit (cat. no. D4001), the two fragments were assembled 
using NEBuilder HiFI DNA Assembly cloning kit (NEB E5520S) to produce 
the construct that lacks both gene C (encoding the protein with the E1 
and E2 domains) and gene D (encoding the JAB-domain protein). 

The plasmids were transformed into £. coliMG1655 cells by electropo- 
ration, and the resulting transformants of the wild-type and mutated 
systems were verified by whole genome sequencing as previously 
described” to verify system integrity and a lack of mutations. A nega- 
tive control was constructed as a transformant containing an empty 
pSGI1 plasmid. 


Cloning of the CBASS from B. cereus into B. subtilis BEST7003 

The two-gene CBASS from B. cereus VD146 (GenBank accession 
KB976672) was identified as locus tags IK1_05630-IK1_05631, and the 
operon was taken together with its upstream and downstream intergenic 
regions, spanning the nucleotide range 60,974—63,493 in GenBank acces- 
sion KB976672. The construct, as well as two additional constructs in 
which one of the two genes was deleted, were commercially synthesized 
and cloned by Genscript directly into the plasmid pSG1-rfp°™ between 
the Ascl and Notl sites of the multiple cloning site. The plasmid was 
transformed into B. subtilis and integrated into the amyF locus as previ- 
ously described". A negative control was constructed as a transformant 
containing an empty pSGI1 plasmid integrated in the amyE locus. The 
resulting transformants of the wild-type and mutated systems were 
verified by whole genome sequencing as previously described”. 


Phage cultivation 

E. coli phages (P1, T4, T5, T7 and A-vir) were provided by U. Qimron. 
Phages SECphi17, SECphi18 and SECphi27 were isolated in our labo- 
ratory". T2 and T6 were ordered from the Deutsche Sammlung von 
Mikroorganismen und Zellkulturen (DSMZ) (DSM 16352 and DSM 4622, 
respectively). The following B. subtilis phages were obtained from the 
Bacillus Genetic Stock Center (BGSC): SPO1(BGSCID 1P4), @3T (BGSCID 
1L1), SPB (BGSCID 1L5), SPR (BGSCID 1L56), ~105 (BGSCID 1L11), p14 
(BGSCID 1L15), SPP1(BGSCID 1P7), SP82G (BGSCID 1P5). Phage @29 was 
obtained from the DSMZ (DSM 5546). Phages SBSphiJ and SBSphiC were 
isolated in our laboratory". Phages were propagated on £. coli MG1655 
or B. subtilis BEST7003 in liquid culture, and their titre was determined 
using the small drop plaque assay method, as previously described”. 


Plaque assays 

Bacteria were mixed with MMB agar (LB + 0.1mM MnCl, +5 mM MgCl, 
+ 0.5% agar), and tenfold serial dilutions of the phage lysate in MMB 
were dropped on top of them. After the drops were dry, plates were 


incubated overnight at 37 °C for £. coliphage and at room temperature 
for B. subtilis phages. Plaques were counted to calculate the efficiency 
of plating in plaque-forming units per millilitre. For phages showing 
a fuzzy killing zone in which single plaques could not be counted, the 
lowest phage concentration in which a killing zone was observed was 
counted as ten plaques. Fold defence was calculated as the efficiency 
of plating on control bacteria divided by the efficiency of plating value 
obtained on bacteria containing the CBASS. 


Phage-infection dynamics in liquid medium 

Overnight cultures were diluted 1:100 in MMB medium and incubated 
at 37 °C while shaking at 250 r.p.m. until early log phase (OD¢o9 = 0.3). 
One hundred and eighty microlitres of the diluted culture were trans- 
ferred into wells in a 96-well plate containing 20 ul of phage lysate for 
a final MOI of 2, 0.2 or 0.02 as applicable. Infections were performed 
in triplicate and OD,o. was followed using a TECAN Infinite 200 plate 
reader with measurement every 5 min. 


Transformation efficiency assay 

To prepare electro-competent cells, F. coliMG1655 cells (with or without 
the CBASS) were diluted 1:100 in 100 ml LB medium supplemented with 
ampicillin, 100 pg/ml. At OD,o9 = 0.6, the cells were transferred to ice for 
15 min and then centrifuged for 15 min at 4,000 r.p.m. The supernatant 
was discarded and the pellet was resuspended in ultrapure ice-cold 
water. This step was repeated twice and the supernatant was removed. 
The resultant pellet was resuspended in 10% glycerol, centrifuged for 
another 10 min and the supernatant was discarded. The final pellet was 
mixed in 500 pl of 10% glycerol and aliquots of 50 pl were flash-frozen 
in liquid nitrogen and transferred immediately to -80 °C. 

One hundred nanograms of pRSFDuet-1 was added to 50 pl electro- 
competent cells and the mixture was transferred to a Bio-Rad Gene 
Pulser Curvette (0.2 cm, cat. no. 165-2086). The cells were electropo- 
rated with a Bio-Rad Micropulser using the ‘Ecl’ setting and then 
immediately transferred to 1 ml LB medium to recover at 37 °C for1 
h. After incubation cells were diluted 1:100, and 100 pl were plated on 
LB plates containing ampicillin (100 pg/ml) and kanamycin (50 pg/ 
ml), and incubated at 37 °C overnight. Transformation efficiency was 
calculated by dividing the number of transformants that grew on LB 
plates containing ampicillin and kanamycin by the live count grown on 
LB ampicillin (100 pg/ml) only. 


Cloning, expression and purification of CapV from E. coli 
TWI11681 

The CapV protein from E. coli TW11681 was PCR-amplified (primers 
P3-Fw + P3-Rv, and P4-Fw + P4-Rv), fused with a C-terminal His tag 
(linker plus His tag sequence: SerGly,His,) and cloned into pET28b. 
The plasmid was transformed into BL21(DE3)pLys cells and grown at 
37 °C, 250 r.p.m., until induction (1 mM IPTG) at an OD¢o, of 0.6. After 
induction, bacterial cell culture growth was continued for an additional 
14 hat a temperature of 18 °C. Cells were then centrifuged for 10 min 
(3,900g, 4 °C) and pellets kept at —20 °C. Pellets were thawed on ice 
and resuspended with an ice-cold buffer containing 50 mM phosphate 
buffer (pH 7.4), 300 mM NaCl, 10% glycerol (v/v). The buffer was sup- 
plemented with 10 pg/ml lysozyme, and 1 pl benzonase-nuclease (SKU 
Merck). The suspension was then mixed with Lysing matrix B (MP) 
beads and cells were disrupted mechanically using a FastPrep-24 (MP) 
bead-beater device (2 cycles of 40 s,6ms7). Cell lysate was centrifuged 
at 12,000g for 10 min at 4 °C and the supernatant was then mixed with 
Ni-NTA magnetic agarose beads (Qiagen) for 2 h (4 °C). CapV-6xHis 
proteins bound to Ni-NTA beads were washed 3 times with a50 mM 
phosphate buffer (pH 7.4), 300 mM NaCl, 10% glycerol (v/v), 20 mM 
imidazole and then eluted with the 50 mM phosphate buffer (pH 7.4), 
300 mM NaCl, 10% glycerol (v/v), 250 mM imidazole. The eluent was 
loaded onto an Amicon Ultra-0.5 centrifugal filter unit 10-kDa filter 
(Merck) to exchange the elution buffer with the reaction buffer (SO 


mM phosphate buffer (pH 7.4), 300 mM NaCl, 10% glycerol (v/v)). 
Buffer exchange was done by centrifuging the centrifugal filter unit 
at 14,000g for 10 min at 4 °C 4 times with the reaction buffer. Eluted 
protein purification was assessed by running the sample on an SDS- 
PAGE gel and quantity was measured by using a Qubit Protein Assay 
Kit (Thermo Fisher Scientific). 


Fluorogenic biochemical assay for CapV activity 

The esterase activity of the 6xHis-tagged CapV was probed with the 
fluorogenic substrate resorufin butyrate. The 6xHis-tagged CapV 
was diluted in 50 mM sodium phosphate (pH 7.4), 300 mM NaCl, 
10% (v/v) glycerol to a final concentration of 1.77 1M. To measure 
the linear range of CapV activation, purified 6xHis-tagged CapV was 
incubated for 5 min with increasing concentrations of 3’3’-cGAMP 
(Merck), ranging from 0.04 to 2.5 pM of 3’3’-cGAMP. In parallel, 6xHis- 
tagged CapV was incubated for 5 min with cell lysates derived from 
E. colicells infected with phage P1 or uninfected. Subsequently, the 
enzyme-cGAMP or enzyme-lysate solution was added to DMSO- 
solubilized resorufin butyrate (stock of 20 mM mixed with 50 mM 
sodium phosphate (pH 7.4), 300 mM NaCl, 10% (v/v) glycerol reaching 
a final concentration of 64 pM (final assay DMSO concentration was 
0.32%)) to a final assay volume of 50 pl, and fluorescence was meas- 
ured in a 96-well plate (Corning 96-well half area black non-treated 
plate with a flat bottom). Plates were read once every 30s for 10 min 
at 37 °C using an Infinite-200 (Tecan) with excitation and emission 
wavelengths of 550 and 591 nm, respectively. The enzymatic reaction 
velocity was measured as previously described**. For this, aregression 
fit was calculated for the output values of each reaction over time, 
and the slope of this linear regression fit was used for determining 
the initial reaction velocity (FUs”). 


Celllysate preparation 

E. coliMG1655 cells containing the F. coli TW11681-derived CBASS in 
which the capV gene was deleted were used for preparation of cell 
lysates. Cells containing an empty vector (pSG1) were used as control. 
Cells were grown in 100 ml MMB medium (flask size 250 ml) at 37 °C 
(250 r.p.m.) until reaching an OD¢o9 of 0.3. Cells were then infected with 
5mlof phage P1 (titre of 10" infective particles per millilitre, estimated 
MOI of 2). After 30 or 40 min from the initial infection, samples were 
collected and centrifuged at 3,900g for 5 min at 4 °C. Following cen- 
trifugation, pellets were kept on ice until resuspended in 600 Wl buffer 
containing 50 mM sodium phosphate (pH 7.4), 300 mM NaCl and 10% 
(v/v) glycerol. The resuspended pellet was supplemented with 1 pl hen- 
lysozyme (Merck) (final hen-lysozyme concentration of 16 pg/ml). The 
resuspended cells were then mixed with Lysing matrix B (MP) beads and 
cells were disrupted mechanically using a FastPrep-24 (MP) bead-beater 
device (2 cycles of 40s,6ms‘, at 4 °C). Cell lysate was then centrifuged 
at 12,000g for 10 min at 4 °C and the supernatant was loaded onto a3-kDa 
filter Amicon Ultra-0.5 centrifugal filter unit (Merck) and centrifuged at 
14,000g for 30 min at 4 °C. The flow-through—containing substances 
smaller than 3 kDa—was used as the lysate sample for evaluating cGAMP 
production within the cells (Fig. 2). 


Quantification of 3’3’-cGAMP by high-performance liquid 
chromatography and mass spectrometry (HPLC-MS) 

Cell lysates were prepared as described in ‘Cell lysate preparation’. 
Lysates collected at 40 min after infection were analysed by MS-Omics 
using ultra-performance liquid chromatography (UPLC) (Vanquish, 
Thermo Fisher Scientific) coupled with a high-resolution quadrupole- 
orbitrap mass spectrometer (Q Exactive HF Hybrid Quadrupole-Orbit- 
rap, Thermo Fisher Scientific). An electrospray ionization interface 
was used as an ionization source. Analysis was performed in positive 
ionization mode. A calibration series of 3’3’-cGAMP (SML1232, Sigma- 
Aldrich) ranging 0.001 to 50 uM was prepared and a linear regression 
from 0.001 to 5 pM was used for cGAMP quantification. 


Article 


FACS analysis of infected cells 

Overnight cultures of F. coliMG1655 cells containing the F. coliTW11681- 
derived CBASS, and E. coli MG1655 cells containing an empty vector 
(pSG1), were diluted 1:100 in 0.5 ml MMB and grown at 37 °C and 500 
rpm to an OD, of 0.3. Cells were then infected with phage PI (titre of 
10” infective particles per millilitre, estimated MOI of 2). At 40 min after 
infection, 40 ul of each culture were diluted into 2 ml of filtered PBS 
containing 1 pl of propidium iodide (Invitrogen LIVE/DEAD BacLight 
Bacterial Viability Kit (L7007)). The diluted bacteria were incubated in 
the dark inroom temperature for five minutes and were then analysed 
by aBIORAD ZES Cell Analyzer. A 5-s agitation was performed and then 
25,000 ungated events were recorded for both the CBASS-containing 
and CBASS- lacking cultures at 0.2 ls“. Forward scatter and propidium 
iodide fluorescence were measured using the height (H) parameter. 
FlowJo v.10 was used to analyse and visualize the data. 


Microscopy of infected cells 

E. coliMG1655 cells that contain the EF. coli TW11681-derived CBASS or 
the same CBASS with a point mutation of the phospholipase catalytic 
site (S60A) were grown in MMB medium at 37 °C. When growth reached 
an OD,op Of 0.3, bacteria were infected with P1 phage (MOI of about 2). 
Five hundred microlitres of the infected samples were centrifuged at 
10,000g for 2 min at 25 °C and resuspended in 5 pl of lx phosphate-buff- 
ered saline (PBS), supplemented with 1 pg/ml membrane stain FM4-64 
(Thermo Fisher Scientific T-13320) and 2 pg/ml DNA stain 4,6-diamidino- 
2-phenylindole (DAPI) (Sigma-Aldrich D9542-5MG). Cells were visualized 
and photographed using an Axioplan2 microscope (ZEISS) equipped 
with ORCA Flash 4.0 camera (HAMAMATSU). System control and image 
processing were carried out using Zen software version 2.0 (Zeiss). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Fold antiphage defence conferred by four-gene phage on the operon-lacking control strain and the efficiency of plating onthe 
defence systems against various phages. The four-gene operon from either operon-containing strain (Fig. 2b, Methods). Bar graph represents average of 
V. cholerae El Tor or E. coliTW11681 was cloned into £. coliMG1655 (Methods). three independent replicates, with individual data points overlaid. Points that 
Fold antiphage defence, as measured by plaque assays, is shown. The fold fall below the x axis (for SECphi17, SECphi18 and SECphi27) denote values lower 


defence was calculated as the ratio between the efficiency of plating of the than1. 
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Extended Data Fig. 2| Transformation efficiency assays. Transformation 
efficiency of plasmid pRSFDuet-1 into strains that contain the four-gene operon 
derived from £. coliTW11681 or from V. cholerae El Tor, presented asa fraction of 
the transformation efficiency to £. coliMG1655 carrying an empty vector instead 
of the four-gene operon. Bar graph represents average of three independent 
replicates, with individual data points overlaid. 
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Extended Data Fig. 3| Efficiency of plating of coliphages on defence systems 
with whole-gene deletions or point mutations. The efficiency of plating of 
phages infecting strains with the wild-type £.-coli-derived four-gene, deletion 
strains and strains with point mutations. Data represent plaque-forming units 
per millilitre; bar graphs represent average of three independent replicates, with 
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individual data points overlaid. Empty vector represents a control £. coliMG1655 
strain that lacks the system and has an empty vector instead. a, Infection with 
the phage T4. b, Infection with the phage T5. c, Infection with the phage T6. 

d, Infection with the phage A-vir. 
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Extended Data Fig. 4 | Efficiency of plating of phage P1 on a double-deletion 
strain. The efficiency of plating is shown of phage Plinfecting strains with the 
wild-type £.-coli-derived four-gene system, strains with individual genes deleted 
anda strain with two genes deleted. Data represent plaque-forming units per 
millilitre; bar graphs represent average of three independent replicates, with 
individual data points overlaid. Empty vector represents a control £. coliMG1655 
strain that lacks the system and has an empty vector instead. 
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Extended Data Fig. 5 | The bacterial CBASS functions through abortive 
infection. a, Growth curves in liquid culture for CBASS-containing and CBASS- 
lacking (empty vector) bacteria infected by phage SECphi18 at 25 °C. Bacteria 
were infected at time =0 at an MOI of 0.02 or 2. Three independent replicates for 
each MOlare shown, and each curve shows an individual replicate. b, Growth 
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curves in liquid culture for cells containing a minimal CBASS comprising 
phospholipase-cGAS (capV-dncV) only. Bacteria were infected at time =0 at an 
MOlof 2 by phage P1. Three independent replicates for each MOI are shown, and 
each curve shows an individual replicate. 
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Extended Data Fig. 6 | Cell sorting of infected cells stained with propidium 
iodide. Cells containing the CBASS derived from £. coli TW11681, and control 
cells containing an empty vector, were stained with propidium iodide, a 
fluorescent DNA-binding agent that penetrates cells that have impaired 
membrane integrity. Cells were infected by phage P1 (MOI of 2) and sorted onthe 
basis of propidium-iodide fluorescence intensity (y axis); the x axis represents 
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forward scatter. a, Uninfected cells that lack the CBASS. b, Uninfected cells that 
contain the CBASS. c, Cells that lack the CBASS, 40 min after infection. d, Cells 
that contain the CBASS, 40 min after infection. A large population of cells with 
high propidium-iodide fluorescence intensity is observed. Datafroma 
representative replicate of two independent replicates are shown. 
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Extended Data Fig. 7 | Microscopy of infected cells. a—c, Phase contrast and 
overlay images are shown, of membrane stain (red) and DAPI (blue) images 
captured at 20 min (a), 40 min (b) and 60 min (c) after infection with phage Plat 
an MOl of 2. The two columns on the left show £. coliMG1655 cells containing the 
CBASS derived from £. coliTW11681. The two columns on the right show F.coli 
MG1655 cells containing the CBASS derived from. coli witha single point 
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phase overlay 


mutation that inactivates the CapV phospholipase (CapV(S60A)). Cell shape is 
deformed after 40 min in cells containing the CBASS, but not in cells in which the 
CBASS is mutated. After 60 min, phage-mediated cell lysis is observed in cellsin 
which the CBASS is mutated. Representative images froma single replicate out 
of two independent replicates are shown. 
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Extended Data Fig. 8 | Atwo-gene CBASS protects Bacillus against phage 
infection. a, Domain organization of atwo-gene operon found in the B. cereus 
VD146 genome. Locus tags of the depicted genes are indicated below each gene. 
b, The two-gene operon from B. cereus VD146 was cloned and genomically 
integrated into B. subtilis BEST7003, which naturally lacks this system. The 
efficiency of plating of phage SBSphiC infecting the CBASS-lacking and CBASS- 
containing strains, as well as strains in which one of the two genes was deleted, is 
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shown. Bar graph represents average of three independent replicates, with 
individual data points overlaid. c, Growth curves in liquid culture for B. subtilis 
containing the B. cereus two-gene CBASS, or CBASS- lacking B. subtilis that 
contains an empty vector instead, infected by phage SBSphiC. Bacteria were 
infected at time = 0 at an MOI of 0.2 or 2. Three independent replicates for each 
MOlare shown, and each curve shows an individual replicate. 
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Extended Data Fig. 9 | Domain analysis and homology-based structure 
prediction ofa bacterial TIR-STING protein. a, Schematics of HHpred”’ 
homology-based search results of the Prevotella corporis TIR-STING protein 
(Supplementary Table 1). b, Phyre2® secondary structure prediction of the TIR 
domain in the P. corporis TIR-STING protein, compared to the solved crystal 
structure of the human TIR domain protein MyD88 (PDB accession 2Z5V_A). 

c, Phyre2*° secondary structure prediction of the STING domain in the P. corporis 
TIR-STING protein, compared to the solved crystal structure of the human 
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STING protein (PDB accession 5BQX_A). Black, identical residues; grey, similar 
residues. Secondary structure prediction for the bacterial protein appears 
above the alignment; secondary structure of solved human domain appears 
below the alignment. d, Structural alignment of human TIR domain protein 
MYD88 and the modelled bacterial TIR domain. e, Structural alignment of 
human STING domain and the modelled bacterial STING domain. Ind, e, blue 
and red represent the structure of the human protein and the model of the 
bacterial domain structure, respectively. 
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The ability of the immune system to eliminate and shape the immunogenicity of 
tumours defines the process of cancer immunoediting'. Immunotherapies such as 
those that target immune checkpoint molecules can be used to augment immune- 
mediated elimination of tumours and have resulted in durable responses in patients 
with cancer that did not respond to previous treatments. However, only a subset of 
patients benefit from immunotherapy and more knowledge about what is required for 
successful treatment is needed? +. Although the role of tumour neoantigen-specific 
CD8* T cells in tumour rejection is well established* °, the roles of other subsets of 

T cells have received less attention. Here we show that spontaneous and 
immunotherapy-induced anti-tumour responses require the activity of both tumour- 
antigen-specific CD8* and CD4" T cells, even in tumours that do not express major 
histocompatibility complex (MHC) class II molecules. In addition, the expression of 
MHC class Il-restricted antigens by tumour cells is required at the site of successful 
rejection, indicating that activation of CD4* T cells must also occur in the tumour 
microenvironment. These findings suggest that MHC class Il-restricted neoantigens 
have a key function in the anti-tumour response that is nonoverlapping with that of 
MHC class I-restricted neoantigens and therefore needs to be considered when 
identifying patients who will most benefit from immunotherapy. 


Immune checkpoint therapy (ICT) demonstrates remarkable clinical 
efficacy in subsets of patients with cancer, but many patients do not 
show durable responses? *. Although MHC class I (MHC-I)-restricted 
neoantigens are important targets of tumour-specific CD8* cytotoxic T 
lymphocytes (CTLs) during successful ICT in both mice and humans>*”, 
current methods to predict patient response to ICT are imprecise and 
additional or better prognostic indicators are needed” ”. The influence 
of MHC class II (MHC-II)-restricted CD4* T cell responses to tumour neo- 
antigens during immunotherapy has only recently been addressed®*”. 
While some reports show that effective tumour immunity can occur in 
the absence of help from CD4* T cells, most indicate that CD4* T cells are 
important for generating tumour-specific CD8* T cells”°*°. However, as it 
has proven difficult to identify tumour-specific mutations that function 
as neoantigens for CD4’ T cells using existing MHC-II antigen predic- 
tion algorithms, considerable uncertainty remains as to whether strict 
tumour specificity in the CD4* T cell compartment is required during 
spontaneous or ICT-induced anti-tumour responses””©”, especially 
for tumours that do not express MHC-II. 


In this study, we used the well-characterized, MHC-II-negative T3 
methylcholanthrene (MCA)-induced sarcoma line, which grows progres- 
sively in wild-type mice but is rejected following ICT ina CD4* and CD8* 
T cell-dependent manner’. Although we have identified point muta- 
tions in laminin-a subunit 4 (LAMA(G1254V); mLAMA4) and asparagine- 
linked glycosylation 8 glucosyltransferase (ALG8(A506T); mALG8) as 
major MHC-Ineoantigens in T3 cells, the identities of T3-specific MHC-II 
antigens remain unknown’. Here we use new predictive algorithms to 
identify an N710Y somatic point mutation in integrin-B1 (mITGB1) as 
a major MHC-II neoantigen of T3 sarcoma cells. In nonimmunogenic 
oncogene-driven KP9025 sarcoma cells (KP), which lack mutational 
neoantigens, co-expression of single MHC-I and MHC-IIT3 neoantigens 
rendered KP9025 cells susceptible to ICT. We find similar requirements 
for vaccines that drive rejection of T3 tumours. In mice bearing con- 
tralateral KP.mLAMA4.mITGB1and KP.mLAMA4 tumours, ICT induced 
the rejection of tumours expressing both neoantigens but not tumours 
expressing mLAMA4 only, indicating that co-expression of both MHC-I 
and MHC-II neoantigens at the tumour site is necessary for successful 
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Fig. 1| mITGB1is a major MHC-II-restricted neoantigen of T3 sarcoma cells. 
a, hmMHC predictions of MHC-II neoantigens expressed in T3 sarcoma cells. 
Potential neoantigens were filtered as shown in Extended Data Fig. 3a and those 
that met the strong binder threshold are shown as expression level (fragments 
per kilobase of transcript per million mapped reads; FPKM) and neoepitope 
ratio (NER). Strong binders are those with -log,,Odds < 26.21. Green line, high- 
expression cut-off (FPKM = 89.1); blue line, high NER cut-off (NER= 6.55). b, CD4* 
T cells isolated from T3 TILs 12 days after transplantation were stimulated in 
IFNy ELISPOT analysis with naive splenocytes pulsed with 2 pg mI“ of the 
indicated individual peptide. Numbers beneath peptides represent average 
number of spots from three independent experiments. c, I-A° tetramer staining 
of CD4*T cells from whole T3 TILs 12 days after transplantation. Cells were gated 
on viable CD11b-CD4* cells. Representative data from one of three independent 
experiments are shown. WT, wild-type. d, Freshly isolated CD4* T cells from day 
12TILs were stimulated with 2 pg ml™ mITGB1(710Y) or wild-type ITGB1(710N) 
peptide-pulsed splenocytes and analysed by IFNy ELISPOT. Data are mean 
+s.e.m. (n=3 independent experiments). *P= 0.03 (unpaired, two-tailed ¢-test). 
e, Mirror plot showing match between MS/MS spectra of the 17-mer peptide 
encompassing mITGB1(N710Y) eluted from T3.CIITA cells (right) anda 
corresponding synthetic peptide (left). Labelled m/z values reflect those 
experimentally observed for the endogenous peptide, with peaks representing 
bionsin blue and yionsinred. 


ICT. These results show that the expression of MHC-II neoantigens in 
tumours is a critical determinant of responsiveness to ICT, personalized 
cancer vaccines and potentially other immunotherapies. 


Predicting MHC-II neoantigens with hmMHC 


The best currently available methods for predicting MHC-II-restricted 
neoantigens rely on tools (netMHCII-2.3 and netMHClIlpan-3.2) that are 
inaccurate, partially because the open structure of the MHC-II binding 
groove leads to substantial variation in epitope length’*”°. Moreover, the 
existing tools cannot be re-trained on new data. We therefore developed 
a hidden Markov model (HMM)-based MHC binding predictor (hmMHC, 


Extended Data Fig. 1a) that inherently accommodates peptide sequences 
of variable length and is trained on recent Immune Epitope Database 
(IEDB) content (Extended Data Fig. Ib-d). Validation analyses showed 
that hmMHC displays substantially higher sensitivity for high-specificity 
values than other predictors (Extended Data Fig. 2a, b). Using hmMHC, 
we calculated the likelihood of each of the 700 missense mutations that 
are expressed in T3 (Supplementary Data 1) being presented by the 
MHC-III-A° molecule and refined our results by prioritizing candidates 
based on I-A? binding affinity, mutant:wild-type I-A° binding ratios, and 
transcript abundance’ (Fig. 1a, Extended Data Fig. 3a). 

One candidate, mITGBI1, met all our criteria (Fig. 1a, Extended Data 
Fig. 3a). Notably, mITGB1 was not selected using netMHCII-2.3 or net- 
MHCllpan-3.2 (Extended Data Fig. 3b, data not shown). Enzyme-linked 
immune absorbent spot (ELISPOT) analysis showed that the mITGB1 
peptide induced high IFNy production from CD4* T3 tumour-infiltrating 
lymphocytes (TILs). Other mutant peptides that fulfilled some but not 
all of our criteria induced only weak or absent responses, thereby vali- 
dating our hmMHC prediction method (Fig. 1b, Extended Data Fig. 3c, 
Supplementary Table 1). To confirm this result, we stained T3-derived 
CD4° TILs with MHC-II tetramers carrying either the 707-721 mITGB1 
peptide or an irrelevant peptide (CLIP). Whereas 5.9% of T3-infiltrating 
CD4' T cells stained positively with the mITGB1-I-A° tetramer, the CLIP- 
1-A° tetramer stained only 0.7% of the cells (Fig. 1c, Extended Data Fig. 3d, 
e). Cytokine profiling of mITGB1-specific CD4* TILs from T3 tumours 
revealed that they produced IFNy, TNF, and IL-2 but not IL-4, IL-10, IL-17 
or IL-22, indicating a phenotype resembling that of T helper type 1 (T,,1) 
cells (Extended Data Fig. 3f). T3 tumour-bearing mice treated with ICT 
did not develop additional MHC-II neoantigen specificities (data not 
shown). To assess whether T3-specific CD4* T cells selectively recog- 
nized the mutant, we compared mutant to wild-type ITGBI1 peptides in 
ELISPOT analyses using freshly isolated T3 CD4* TILs. Only the mITGB1 
peptide induced positive responses (Fig. 1d). Similar data were obtained 
using CD4* T cell hybridomas generated from T3 TILs (Extended Data 
Figs. 4, 5a). 

Mapping experiments revealed that the MHC-II binding core of 
mITGBI consists of nine amino acids (“°YNEAIVHVV”“°), in which the 
mutant Y710 residue functions as an I-A’ anchor (Extended Data Fig. 5b). 
To verify that the mITGB1 epitope is physiologically presented by MHC- 
Il, we transduced T3 cells with a vector encoding the mouse MHC-II 
transactivator CIITA (T3.CIITA cells), which induced high levels of I-A? 
expression”® (Extended Data Fig. 5c). Elution of peptides bound to I-A° 
on T3.CIITA cells and analysis by mass spectrometry identified two 
mlITGB1 peptides encompassing the Y710 mutation (a 17-mer and a 
14-mer; Fig. le, Extended Data Fig. 5d). Peptides with the corresponding 
wild-type sequence were not found. The mITGBI epitope was also not 
detected in MHC-I eluates from IFNy-stimulated T3 cells, and mITGB1- 
specific CD8* T cells were not observed by cytokine production (data 
not shown). Together, these data demonstrate that mITGB1 is a major 
MHC-Il-restricted neoantigen of T3 sarcoma cells. 


ICT response requires CD4*T cell help 


Recent publications have highlighted the ability of CD4* T cells to rec- 
ognize tumour-specific antigens and promote tumour rejection inthe 
absence of ICT®”**°. To assess whether CD4* T cells are required during 
ICT-induced rejection, we expressed MHC-I and/or MHC-II neoantigens 
from T3 sarcoma cells in an oncogene-driven sarcoma cell line gener- 
ated froma Kras‘“°" x Tp53/" mouse injected intramuscularly with 
lentiviral Cre-recombinase (KP9025 cells)’. The unmodified KP9025 
sarcomaline formed progressively growing tumours in either syngeneic 
wild-type mice treated with or without dual anti-PD-1 and anti-CTLA4 
ICT or mice rechallenged with unmodified KP9025 after previously 
being cured of their KP9025 tumours via surgical resection (Fig. 2a, b). 
As this challenge-resection-rechallenge approach promotes immune 
control or rejection of even poorly immunogenic tumour cells used in 
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Fig. 2 | ICT-mediated rejection of anonimmunogenic sarcoma requires CD4* 
and CD8‘ T cells. a, One million KP9025 sarcoma cells were injected 
subcutaneously into syngeneic 129S4 mice and animals were treated with either 
acontrol monoclonal antibody or the anti-PD-1+ anti-CTLA4 combinationon 
days 3, 6, and 9 after transplantation. Representative data fromtwo 
independent experiments are shown as mean tumour diameter + s.e.m. (n=Sin 
all groups per experiment). b, KP9025 sarcoma cells were injected as above and 
tumours were surgically resected followed by rechallenge with the same line. 
Representative data from one of two independent experiments are shown as 
meantumour diameter +s.e.m. (n=3inall groups per experiment). c, Cohorts of 
five mice were injected with 1 x 10° KP.mLAMA4, KP.mITGB1, KP.mLAMA4. 
mITGBI, or KP.mSB2.SIINFEKL cells and treated with either control monoclonal 
antibody (top) or the anti-PD-1+ anti-CTLA4 combination (bottom) on days 3, 6, 
and 9 after transplantation. Representative data from one of three independent 
experiments are shownas individual tumour diameters. 


the initial priming step”, these results supported the conclusion that 
KP9025 sarcoma cells were not immunogenic. Whole-exome sequenc- 
ing revealed that KP9025 cells expressed only four nonsynonymous 
mutations (Supplementary Data 2) and none were predicted to beimmu- 
nogenic (Extended Data Fig. 6a, b, Supplementary Table 2). Enforced 
expression of either mLAMA4 or mITGB1 alone did not render KP9025 
cells immunogenic in wild-type mice in the presence or absence of ICT 
(Fig. 2c, Extended Data Fig. 6d, e). Progressively growing KP. mLAMA4 
tumours maintained expression of their MHC-I tumour neoantigen, 
thereby ruling out antigen loss via immunoediting (Extended Data 
Fig. 7a). KP9025 cells expressing both mLAMA4 and mITGBI1 formed 
tumours in immunodeficient Rag2” mice that grew with kinetics simi- 
lar to those of KP. mMLAMA4 or KP.mITGB1 cells (Extended Data Fig. 6c). 
However, growth of KP.mLAMA4.mITGBI cells in wild-type mice treated 
with a control monoclonal antibody was noticeably slower than that 
of either single-antigen-expressing cell line, and KP.mLAMA4.mITGB1 
tumours were rejected in wild-type mice following either dual or single 
agent ICT despite the absence of tumour cell MHC-II expression (Fig. 2c, 
Extended Data Fig. 6d, e, data not shown). 

We considered the possibility that the enhanced immunogenicity of 
KP.mLAMA4.mITGB1 tumours was merely a function of antigen quantity. 
Therefore, we generated KP9025 cells that lacked MHC-II neoantigens 
but co-expressed two strong MHC-I neoantigens: the MHC-I epitope 
of ovalbumin (SIINFEKL) and the R913L mutant of spectrin-B2 (mSB2), 
which contributes to the spontaneous rejection of the MCA-induced 
d42ml1 sarcoma line in wild-type mice’. KP.mSB2.SIINFEKL tumours grew 
progressively in mice treated either with acontrol monoclonal antibody 
or dual ICT, and the expression of both MHC-1 antigens was maintained 
in growing tumours from ICT-treated animals (Fig. 2c, Extended Data 
Fig. 7b-d). Enforced expression of mITGB1 in KP.mSB2.SIINFEKL cells 
led to significantly (P=1.5 x 10°) increased survival of ICT-treated mice 
injected with the uncloned tumour line (Extended Data Fig. 7e). Thus, 
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tumour rejection and ICT sensitivity are dependent on combinatorial 
effects of CD4* and CD8°T cells. 


mITGB1CD4'T cells are T,,1 polarized 


We next investigated whether mITGB1-specific CD4* TILs displayed a 
T,1 phenotype similar to that seen with T3 tumours. Seventy-four per 
cent of mITGB1 tetramer-positive CD4* T cells in KP.mLAMA4.mITGB1 
tumours from control-treated mice expressed the T,,1-associated tran- 
scription factor T-BET, but not the regulatory T cell (T,.,)-associated 
transcription factor FOXP3. An additional 17% expressed both T-BET and 
FOXP3. Conversely, tetramer-negative CD4' T cells showed substantially 
diminished expression of T-BET (24%) and much higher expression of 
FOXP3 expression (61%). mITGB1-tetramer* CD4" T cells displayed a 
higher T-BET*:FOXP3’ ratio than tetramer-negative cells (4 versus 0.4, 
respectively) and this ratio was further increased in response to anti- 
CTLA4 treatment (33 versus 3.7, respectively; Extended Data Fig. 8a—c). 
Onaverage, 83% of mITGB1-specific CD4’ T cells expressed high levels of 
PD-1 compared to only 19% of mITGB1-tetramer-negative cells (Extended 
Data Fig. 8d, e). CD4" T cells specific for mITGB1 also expressed high 
levels of CD44, ICOS and CD150 (also known as SLAM), and low levels 
of KLRG1 (Extended Data Fig. 8f). The presence of an expanded popula- 
tion of T,,1-like ICOS* CD4* T cells was recently reported in mice bearing 
B16 or MC38 tumours that were treated with anti-CTLA4, although the 
tumour antigen specificity of this population was not identified”. These 
data, together with the cytokine profiles described above, indicate that 
mITGB1-specific CD4* T cells display an activated T,,1 phenotype. 


CTL generation requires CD4' T cell help 


To identify the mechanism by which tumour neoantigen-specific CD4* 
T cells influence ICT-mediated anti-tumour responses, we assessed their 
effects on CD8*T cell priming by comparing MHC-I tetramer staining of 
splenic mLAMA4-specific CD8" T cells from mice bearing KP.mLAMA4 
or KP.mLAMA4.mITGB1 tumours, treated with a control monoclonal 
antibody or ICT. Inthe absence of ICT, mLAMA4-H-2K° tetramers stained 
only 1.2% of CD8* T cells from mice bearing KP.LAMA4 tumours, but 5.3% 
of CD8* T cells in mice bearing KP.mLAMA4.mITGBI1 tumours (Fig. 3a, 
b). This staining percentage was unchanged in the presence of PD-1 
blockade, but was increased by anti-CTLA4 treatment, either as mono- 
therapy or in combination with anti-PD-1. This result is consistent with 
the observation that anti-CTLA4 treatment functions largely to enhance 
CD4* T cell responses”. 

To assess whether MHC-II neoantigens also enhanced CTL formation, 
we used anin vivo T cell cytotoxicity assay that monitored the capacity 
of naturally arising CTLs to kill peptide-pulsed splenocytes labelled with 
carboxyfluorescein succinimidyl ester (CFSE)**. Non-tumour-bearing 
control mice and mice bearing KP.mLAMA4 tumours were largely inca- 
pable of eliminating mLAMA4 peptide-pulsed splenocytes in either 
the presence or absence of ICT (Fig. 3c). By contrast, mice bearing 
KP.mLAMA4.mITGB1 tumours efficiently eliminated CFSE"-labelled, 
mLAMA4 peptide-pulsed splenocytes but not CFSE'-labelled SHNFEKL- 
pulsed splenocytes, and the degree of elimination of the former was 
enhanced by ICT (Fig. 3c, d). The cytotoxic activity of control-treated 
mLAMA4-specific CD8* T cells observed inthe splenocyte killing assay 
was higher than would be expected from our in vivo tumour rejection 
experiments (Fig. 2e). This difference is likely to reflect differences in 
the susceptibility of splenocytes and tumour cells to T cell-mediated 
killing. Thus, CD4* T cell help enhances both CD8’ T cell priming and 
maturation of CD8' T cells into CTLs. 


Vaccines require MHC-I and MHC-II antigens 


As CD4" T cell help was crucial for generating mLAMA4-specific CTLs 
during ICT, we tested whether mITGB1-specific CD4* T cells were 


a Anti-PD-1+ b 
Control Anti-PD-1 Anti-CTLA4 anti-CTLA4 
108 0.93 3.39 2.80 204 KP.mLAMA4 
ww 10* 5 @ — |KP.mLAMA4.mITGB1 
KP.mLAMA4 & 403 eas tee 
2 uae mn OL LJ 
¥ 10? #3 + 
og z° 10 — soe 
x am : . ¥ 
= 10° {5.30 5.17 32 - 
KP.mLAMA4. 5 10° Eo 5 $ a : 
mITGB1 2 a ee, rid 
Control Anti- Anti- — Anti-PD-1 + 
0 109 104705 "0 10° FOF 105-0 108 10#105 0109 10* 105 py: Gil ene as 
mLAMA4-H-2-K®-APC 
be Control d / Anti-PD-1 + 
| | SIINFEKLCFSE-lo Control Anti-PD-1 Anti-CTLA4 = anti-CTLA4 
Naive | j mLAMAACFSE-Hi es 1 ee 
Mf eke 100 ee 
Bhd Anti-PD-1 Anti-CTLA4 anti-CTLAd a —_ = 
3 T ] T ] T | © P=2x10 i = 
5 I | I j | I | 8 75 © aa at 
KP.mLAMA4 5 | | | I | | Eg eed 
2 | Hf 1} | 3) a F 
5 2% $ 50 lar 
Zz g's a 
Cc 
1e} 
KP.mLAMA4. 22 55 soy . - 
mITGB1 3 . 1 ae 
g eG - 
402 102 10* 10° 102 10° 10? 108 102 10° 10 108 102 10° 10* 10° KPmLAMAA KPmLAMA4 mITGBI 


CFSE CFSE CFSE CFSE 


Fig. 3|CD4*T cell help is required for the generation of functional CD8* CTLs 
duringICT. a, Representative tetramer staining of mLAMA4-specific CD8* 
Tcells from the spleens of mice bearing KP. mLAMA4 (left) or KP.mLAMA4. 
mITGBI1 (right) tumours 12 days after transplantation. Mice received the 
indicated ICT treatment on days 3, 6, and 9. Cells were gated from viable 
CD45°CD11b Thyl.2’ cells. b, Quantification of three independent experiments 
described ina, shown as mean +s.e.m. per cent mLAMA4 tetramer-positive CD8* 
Tcells. *P=0.04, ***P= 0.0007, ****P= 0.00003 (two-way ANOVA with multiple 
comparisons corrected with the Bonferroni method). c, In vivo cytotoxic 
function of mLAMA4-specific CD8* T cells. Naive splenocytes were labelled with 


also important for vaccine-elicited anti-tumour responses (Fig. 4a). 
Vaccination of naive recipient mice with irradiated parental KP902S5, 
KP.mLAMA4, or KP.mITGB1 cells was not sufficient to protect most mice 
froma subsequent challenge with T3 sarcoma cells. Vaccination witha 
mixture of irradiated KP.mLAMA4 and KP.mITGBI1 cells provided protec- 
tion against T3 challenge in 30% of mice. By contrast, vaccination with 
irradiated KP. mMLAMA4.mITGBI cells prevented T3 tumour outgrowth 
in11 of 13 recipients (Fig. 4b, c). Furthermore, spleens from mice vacci- 
nated with irradiated KP.mLAMA4.mITGBI cells contained significantly 
(P=0.0002) more mLAMA4-specific, IFNy-producing CD8* T cells than 
did the spleens of mice vaccinated with KP cells expressing only mLAMA4 
(Fig. 4d). The differences in efficacy between mixed cellular vaccines 
and dual antigen-expressing KP. mLAMA4.mITGBI vaccines support 
previous findings that effective vaccines are those in which the MHC-I 
and MHC-II epitopes reside on the same peptide strand, potentially 
leading to more efficient uptake and presentation of both antigens by 
the same antigen-presenting cell (APC)*°*’. A similar situation would be 
expected to occur when both antigens were present in the same tumour 
cell used for vaccination. 


MHC-II antigen expression at tumour site 

To investigate whether CD4* T cells are required beyond the priming 
and maturation of anti-tumour CTLs, we tested whether tumour cell 
expression of MHC-II neoantigens was necessary at the site of tumour 
rejection. We assessed the in vivo growth of contralaterally injected 
KP.mLAMA4.mITGB1 and KP.mLAMA4 tumours in either immunode- 
ficient or immunocompetent mice treated with ICT. The contralat- 
eral tumours grew at equivalent rates in Rag2” mice (Extended Data 
Fig. 9a). However, ICT treatment of wild-type mice bearing contralateral 
tumours resulted in complete rejection of the KP.mLAMA4.mITGB1 
tumour but only delayed outgrowth of the KP. mLAMA4 tumour on 


0.5 uM CFSE and pulsed with 1 1M SIINFEKL peptide (white histograms) or 
labelled with 5 uM CFSE and pulsed with 1 uM mLAMA4 peptide (green 
histograms) and transferred into control naive or tumour-bearing mice 11 days 
after tumour transplantation. Tumour-bearing mice received the indicated ICT 
treatment on days 3, 6,and 9 after transplantation. Representative data from six 
(anti-CTLA4) and eight (all other groups) independent experiments are shown. 
d, Quantification of per cent mLAMA4-specific lysis from independent in vivo 
cytotoxicity assays inc shownas mean +s.e.m. (n=6in anti-CTLA4, n=8inall 
other groups). Pvalues calculated using two-way ANOVA with multiple 
comparisons and Bonferroni correction. 


a 
Lethally irradiated 10 _ 10days T3 MCA 
tumour vaccine sarcoma, DO — > Growth or rejection 
( 
») 
KP.mLAMA4+ 
As KP.mLAMA4 KP.mITGB1 KP.mITGB1 KP.mLAMA4.mITGB1 


0 2/16 112 3/15 1/13 
5 
0. 
5 
0. 


0 10 20 30 40 500 10 20 30 40 500 10 20 30 40 500 10 20 30 40 500 10 20 30 40 50 
Days post transplant 


Tumour diameter (mm) @ 
= Se 


c d OVA mLAMA4 
00 KP.mLAMA4.mITGB1 KP.mLAMA4 

E80 KP.mLAMA4 P.mLAMA4.mITGB1 3 
= L Siege 8 
5 60 KP.mLAMA4 +]8] 2] S| & ag eee 
2 iKP.miTaBt |] x} x/x 2g 600 OVA 
5 40 ul ef a) Oo o6 400 
5 99{KP.mITGB1 xe Jo} aly 38 renee 
a KP9025 * 3g 200 

T T T T T 1 wo 

OF AO eee 2. 8. 280 eR 0 LAMA4 LAMA4.miTGB1 

Days post transplant g 7 i ‘i i 
<< Vaccination 


Fig. 4|MHC-II neoantigens are required for optimal tumour vaccine 
efficacy. a, Schematic of tumour vaccine strategy. Naive syngeneic 129S6 mice 
were vaccinated with 5 x 10° lethally irradiated KP sarcoma cells expressing the 
indicated antigens. Ten days after vaccination, mice were injected with 2 x 10° T3 
sarcoma cells on the opposite flank and growth or rejection of T3 tumours was 
monitored. b, Growth curves of T3 sarcoma cells in vaccinated mice asina. Data 
are individual tumour diameters from mice injected in three independent 
experiments (numbers in figure represent number rejected over total number n 
for each group). c, Kaplan-Meier curves showing survival of mice in b. Pvalues 
calculated using Mantel-Cox test. d, ELISPOT analysis of 1 1M peptide-pulsed 
splenocytes 10 days after vaccination of naive mice with irradiated KP.mMLAMA4 
or KP.mLAMA4.mITGB1 cells as in a. Data from three independent experiments 
are shown as mean +s.e.m. number of spots. ***P= 0.0002 (unpaired, two-tailed 
t-test). 
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Fig. 5| Expression of an MHC-II neoantigen by tumour cells has localized 
impact on tumour composition. a, Wild-type syngeneic 129S4 mice were 
injected with1 x 10° KP.mLAMA4 cells followed by treatment with anti- 

PD-1+ anti-CTLA4 on days 3, 6, and 9 after transplantation. Representative data 
from one of three individual experiments are shown as individual tumour 
diameters (n=5 per group per experiment). b, Mice were injected on opposite 
sides with 1x 10° KP.mLAMA4 cells and 1x 10° KP.mLAMA4.mITGBI1 cells followed 
by treatment as ina. Representative data from one of three individual 
experiments are shownas individual tumour diameters (n=5 per group per 
experiment). c, Mice were injected asin b and IFNy ELISPOT analysis of tumour- 
infiltrating CD4 T cells stimulated with naive splenocytes pulsed with 2 pg mI* 
of the indicated peptides was performed 11 days after transplantation. Numbers 
beneath images indicate the average number of spots in mITGB1-stimulated 
wells from three independent experiments. d, Tetramer staining of mMLAMA4- 
specific CD8* TILs 11 days after transplantation of mice in b. Representative data 
from one of four independent experiments are shown as per cent of mMLAMA4- 
specific cells within the CD8" T cell population. e, Quantification of tumour- 
infiltrating T cells from mice in b 11 days after transplantation. Data are shownas 
per cent of total viable CD45* cells +s.e.m. *P= 0.02, **P=0.009 (unpaired, 
two-tailed t-test). 


the opposite flank (Fig. 5a, b). This result shows that CTLs specific for 
mLAMA4 can control tumours expressing both the cognate MHC-I 
epitope and the helper MHC-II epitope locally, but function poorly 
against distant but related tumours that lack CD4 neoepitopes. In 
similar experiments, we investigated whether mITGB1-specific CD4* 
T cells generated from KP.mLAMA4.mITGBI tumours were sufficient 
to control the outgrowth of KP.mITGB1 tumours on the opposite flank. 
In this setting, contralateral KP.mITGB1 tumour growth was identi- 
cal to that observed in mice bearing only a single KP.mITGB1 tumour 
(Extended Data Fig. 9b, c). Together, these results show that tumour 
cell expression of MHC-II-restricted neoantigens and the presence 
of tumour-specific CD4" T cells in the tumour microenvironment are 
required to maintain tumour control during ICT but are not sufficient 
to mediate tumour rejection by themselves. 

To expand this observation, we investigated whether CD4* T cells 
and expression of MHC-II neoantigens in tumour cells are required to 
maintain functional CD8* T cell memory. When mice that had been 
cured of T3 tumours by ICT treatment were rechallenged with T3 tumour 
cells, they rejected the cells. However, if mice were depleted of CD4* 
T cells before being rechallenged, they did not control T3 tumour out- 
growth (Extended Data Fig. 9d). In parallel experiments, mice previously 
cured of KP.mLAMA4.mITGB1 tumours by surgical resection were pro- 
tected against subsequent rechallenge with KP. mLAMA4.mITGBI1 but 
were unable to prevent outgrowth of KP.mLAMA4 or KP9025 tumours 
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(Extended Data Fig. 9e). Thus, both expression of MHC-II neoantigens 
by tumour cells and CD4* T cell help are required for the maintenance 
of tumour-specific immunologic memory. 

Last, we investigated whether an MHC-II tumour neoantigen can sig- 
nificantly affect the local tumour microenvironment (gating strategy, 
Extended Data Fig. 10a). The expression of inducible nitric oxide syn- 
thase (iNOS) is higher in macrophages that populate tumours destined 
to be rejected after ICT than in macrophages from progressively growing 
tumours, and this expression is induced by ICT-dependent production of 
IFNy. iNOS* macrophages were present at threefold higher levels in ICT- 
treated KP.mLAMA4.mITGB1 tumours than in contralateral KP.mLAMA4 
tumours (Extended Data Fig. 9g, h). ELISPOT analysis of tumour-infiltrat- 
ing CD4* T cells showed 5.9-fold more IFNy* mITGB1-specific CD4* T cells 
in KP.mLAMA4.mITGBI1 tumours than in contralateral KP.mLAMA4 
tumours (Fig. 5c, Extended Data Fig. 9f). Flow cytometry analysis of 
the lymphoid compartment (gating strategy, Extended Data Fig. 10b) 
identified 3.7-fold more CD8* T cells and 9-fold more mMLAMA4-spe- 
cific CD8* T cells in KP. mMLAMA4.mITGB1 tumours than in KP.mLAMA4 
tumours (Fig. 5d, e). We then investigated whether CD4* T cells were 
sufficient to mediate these changes, by comparing iNOS* macrophages 
inKP.mLAMA4.mITGB1 tumours with those in contralateral KP.mITGB1 
tumours. KP. mLAMA4.mITGB1 tumours contained 83-fold more iNOS* 
macrophages than did KP.mITGB1 tumours (Extended Data Fig. 9i, j). 
Together, these data show that MHC-Il-restricted anti-tumour responses 
are necessary but not sufficient in ICT-sensitive tumour models to induce 
localized effects on the immune composition of tumours. 


Discussion 

The work described herein focuses on the functional role of MHC-II 
restricted tumour neoantigens in mediating ICT-dependent anti-tumour 
responses ina well-characterized mouse sarcoma model. Using an HMM- 
based tool (hmMHC), we have predicted and validated that an N710Y 
point mutation in the integrin ITGB1 forms a major MHC-II restricted 
neoepitope of the T3 MCA sarcoma. It is reasonable that mITGB1 repre- 
sents amajor MHC-II neoantigen of T3 tumour cells because ITGB1is the 
second most highly expressed mutation in T3 tumour cells and the point 
mutation in mITGB1 generates a novel anchor residue that promotes high 
affinity binding to I-A°. Moreover, others have proposed that secreted 
tumour proteins are favoured targets for CD4* T cell responses because 
they are more easily taken up by professional APCs**. Localization of 
mITGB1 on the cell membrane would also be likely to facilitate efficient 
access by APCs, although we did not directly test this idea. Notably, we do 
not rule out the possibility that T3 cells express other MHC-II-restricted 
epitopes that might be elicited by vaccination’*””. Nevertheless, we have 
shown that mITGBI functions as a major neoantigen of T3 cells during 
naturally occurring anti-tumour responses. 

By defining authentic MHC-I and MHC-II neoantigens of T3 sarcoma 
cells, we have shown that, in a minimal antigen system, a single clonally 
expressed MHC-I neoantigen (mLAMA4) anda single clonally expressed 
MHC-II neoantigen (mITGBI) are necessary and sufficient to render 
nonimmunogenic, oncogene-driven KP9025 sarcoma cells sensitive to 
ICT. Using KP9025 sarcoma cells that express different combinations of 
mLAMA4 and/or mITGBI1, we have shown that CD4" T cell responses are 
required for optimal priming of MHC-I restricted CD8* T cells and their 
maturation into CTLs, in either the presence or absence of ICT. We have 
also shown that optimal anti-tumour responses occur when tumour cells 
express both MHC-Iand MHC-II neoantigens. In part, this requirement 
reflects the potential need for CD4’ T cell responses in the tumour micro- 
environment and, from previous work, appears to be at least partially 
due to production of IFNy by tumour-specific CD4* T cells®. We find it 
of particular interest that the generation of effective tumour immunity 
requires MHC-II neoantigens following either vaccination with tumour- 
specific neoantigen vaccines or ICT. These results provide new insights 
into the role of MHC-II neoantigens in natural and therapeutic immune 


responses to tumours. They also suggest that patients with tumours 
that are predicted to contain immunogenic MHC-I neoantigens or have 
favourable tumour mutational burdens could still be unresponsive 
to immunotherapies, owing to the absence of immunogenic MHC-II- 
restricted CD4" T cell antigens. This possibility has not been critically 
evaluated yet, owing to the past absence of reliable MHC-II prediction 
algorithms. Future work is needed to test this hypothesis in patients 
with cancer undergoing immunotherapy. 

Note added in proof: As this Article was being prepared for publication, 
an independent paper was published online describing an MHC-II 
prediction algorithm for human tumours”. 
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Methods 


Mice 

Male wild-type 129S6 mice (for experiments involving T3 cells) were 
purchased from Taconic Farms. Male wild-type 129S4 mice (for experi- 
ments involving KP9025 cells) and129S6 Rag2” mice were bred in our 
specific-pathogen free facility. Allin vivo experiments were performed 
in our specific-pathogen free facility and used mice between the ages 
of 8 and 12 weeks. All experiments were performed in accordance with 
procedures approved by the AAALAC-accredited Animal Studies Com- 
mittee of Washington University in St Louis and were in compliance with 
all relevant ethical regulations. 


Tumour transplantation 

T3 MCA-induced sarcoma cells were previously generated in our labo- 
ratory in 129S6 wild-type mice. KP sarcoma cell lines were provided 
by T.J., and were generated following intramuscular injection of lenti- 
viral Cre-recombinase into 129S4 Kras‘5“¢2" x Tp53/" mice. Tumour 
cells were cultured in Roswell Park Memorial Institute (RPMI) medium 
(Hyclone) supplemented with 10% fetal calf serum (FCS) (Hyclone). Cell 
lines were authenticated using whole-exome sequencing and verifica- 
tion of specific antigen expression. All cell lines used tested negative 
for mycoplasma contamination. For transplantation, cells were washed 
extensively in PBS and resuspended at a density of 13.34 x 10° cells per 
ml (T3) or 6.67 x 10° cells per ml (KP sarcomas) in PBS. Then, 150 pl was 
injected subcutaneously into the rear flanks of syngeneic recipient 
mice. For irradiated tumour cell vaccines, KP.mLAMA4, KP.mITGBI1 or 
KP.mLAMA4.mITGBI1 sarcoma cells were lethally irradiated with 10 Gy 
and 500,000 cells were injected subcutaneously into 129S6 mice. T3 
challenge following vaccination occurred on the opposite flank. Fol- 
lowing tumour transplantation, animals were randomly assigned to 
treatment groups. No statistical methods were used to determine group 
size. Tumour growth was measured by calipers and individual growth 
curves are represented as the average of two perpendicular diameters. 
Tumour measurements were performed blinded to treatment group. 
In accordance with our IACUC-approved protocol, maximal tumour 
diameter was 20 mm in one direction, and in no experiments was this 
limit exceeded. 


Tumour rechallenge 

For tumour rechallenge following surgical resection, primary tumours 
were allowed to grow until they reached 10 mm in size or to the time 
point indicated. Following surgical removal of the established tumour, 
animals were rested for 30 days. Animals were then rechallenged on 
the opposite flank with either the same tumour line as was used in the 
primary tumour challenge or the tumour line indicated. For tumour 
rechallenge following ICT-mediated rejection, primary tumours were 
rejected following treatment with combination anti-PD-1 and anti- 
CLTA4 ICT. After tumours were no longer apparent, animals were rested 
for 30 days followed by rechallenge on the opposite flank with the same 
tumour line as was used in the primary challenge or the tumour line 
indicated. 


Epitope prediction 

The identification of point mutations in T3 and KP sarcomas and the 
prediction of MHC-I epitopes in KP and F244 sarcomas were performed 
as previously described’. To predict neoepitopes, we applied hmMHC, 
our newly developed HMM-based binding predictor, trained on the most 
recent IEDB data. HMMs inherently accommodate inputs of variable 
length and have already demonstrated reasonable performance for 
prediction of MHC binding affinity*. Our predictor uses a fully con- 
nected HMM with emissions representing amino acids (see a pedagogi- 
cal example in Extended Data Fig. 1a). We trained the model ona set of 
known binders using the Baum-Welch algorithm”, as implemented 
by the GHMM library (http://ghmm.sourceforge.net/). A trained HMM 


returns the likelihood of a peptide to be a binder, which we represent 
as the —log,, odds ratio, where a smaller value indicates that a peptide 
has a higher likelihood of being a binder. The model that we apply in 
this study was trained on murine H2-I-A° binders taken from the IEDB 
full MHC ligand export (downloaded on 25 November 2018, contain- 
ing 1,072,460 entries). Non-binders were not used in model training. 
The categorization of the data into binders and non-binders was based 
on the qualitative and quantitative fields of IEDB entries: binders are 
peptides with IC,, < 500 nM or with positive, positive-high or positive- 
intermediate binding quality. These data came largely from mass 
spectrometry assays. We validated the model using the Monte Carlo 
(shuffle-split) cross-validation approach, with ten random partitions of 
H2-I-A° binders from IEDB into training and validation sets, with a rela- 
tive validation set size of 0.2. As the number of non-binders in the IEDB 
dataset was insufficient for validation, we used decoy sets composed 
of random natural peptides as non-binders. Protein-coding transcript 
translation sequences for Mus musculus were obtained from GENCODE 
release M19 (GENCODE project, 2018); there are 65,257 translations. For 
every cross-validation partition, the translations were randomly cut into 
fragments uniformly distributed in the interval [12, 24], which gener- 
ated about 1.5 x 10° fragments. Of this set of random natural peptides, 
arandom sample 100 times the number of binders in the validation set 
was taken. The 100-fold bias in the number of generated non-binders 
and uniform distribution of their lengths are in line with recent work on 
MHC binding prediction, in particular netMHCpan-4.0°°. We have also 
performed experiments in which the distribution of random natural 
peptide lengths followed the distribution of lengths in the IEDB dataset 
(Extended Data Fig. 1d) and found no significant difference in results in 
our setting compared to uniform distribution. The rationale for the 100- 
fold bias is that for asample of peptide fragments from an organism, it is 
commonly considered that about 1-2% will bind to MHC receptors. On 
average, there were 4,412 binders ina training set, and 771 binders and 
77,086 random natural peptides ina validation set. Classification perfor- 
mance of our predictor was significantly higher than the performance 
of the two best-known class II binding predictors™ (netMHCII-2.3 and 
netMHCllpan-3.2), compared on our ten validation datasets. This is due, 
in part, to the large amount of new mass spectrometry data compared 
to the data on which the recent netMHCII(pan) predictors were trained 
(netMHClIpan-3.2 public dataset available at http://www.cbs.dtu.dk/ 
suppl/immunology/NetMHCllpan-3.2/ contains 1,794 measurements 
for H-2-1-A®, all qualitative, of which 431 are binders and 1,363 are weak 
ornon-binders). We do not exclude the possibility that netMHCII(pan), 
as amethod, performs better than the HMM method. As the published 
netMHCII(pan) tools lack re-training capability, we cannot compare 
the methods and draw conclusions on netMHCII(pan) performance on 
new qualitative data. We determined the threshold for strong binders 
by calibrating the predictor to return a percentile rank against a large 
decoy set of random natural peptides. We used the approach taken by 
the existing neural network-based predictors, in which strong binders 
are predictions in the second percentile of the empirical distribution 
of predictions on random natural peptides*°. The decoy set was gener- 
ated from the mouse proteome in the same way as for validation and 
consists of about 1.5 x 10° fragments with lengths in the interval [12, 24]. 
Predicted neoantigens were further prioritized using the NER: the ratio 
between the binding predictions for the mutant and wild-type peptides. 
Expression of each mutation is represented as FPKM generated from 
cDNA capture sequencing. 


Peptides 

All 27-mer peptides used for neoantigen screening (Supplementary 
Table 1) were purchased from Peptide 2.0 and purified by high- 
performance liquid chromatography (HPLC) to >95% purity. The T3- 
specific mutant amino acid was placed in the centre of the peptide and 
was flanked on both sides with 13 amino acids of wild-type peptide 
sequence. 


ELISPOT 

Cells from tumours or lymph nodes were enriched for CD4* or CD8* 
T cells using the Miltenyi mouse CD4* or CD8* enrichment kits accord- 
ing to the manufacturer’s protocols. Ten thousand TIL-derived T cells 
or 50,000 tumour-draining lymph node (TDLN)-derived T cells were 
stimulated with 500,000 splenocytes isolated from naive mice pulsed 
with 2 pg ml 29-mer peptide (class II) or 1 14M 15-mer peptide (class 1). 
For analysis from spleens, 500,000 cells from whole-spleen preparations 
were used. Cells were stimulated overnight in anti-mouse IFNy-coated 
ELISPOT plates (Immunospot). Plates were developed according to 
the manufacturer’s protocol and spots were quantified using a CTL 
ImmunoSpot S6 Universal machine and Professional 6.0.0 software. 


Mass spectrometry 

For isolation of I-A°-bound peptides, 5 x 10°T3.CIITA cells were washed 
twice with PBS and snap-frozen. MHC-II molecules were isolated by 
immunoaffinity purification using the I-A°-specific antibody Y-3P (Biox- 
Cell) coupled to cyanogen bromide-activated sepharose 4B (GE Health- 
care) as described*. Peptides were eluted with 0.2% trifluoroacetic acid, 
cleaned by detergent removal (Pierce Detergent Removal Spin Columns, 
Thermo Scientific) and desalting (Pierce C-18 Spin Columns, Thermo 
Scientific), dried, and resuspended in 2% acetonitrile (ACN) and 0.1% 
formic acid (20 pl). For mass spectrometry, a Dionex UltiMate 1000 
system (Thermo Scientific) was coupled to an Orbitrap Fusion Lumos 
(Thermo Scientific) through an Easy-Spray ion source (Thermo Scien- 
tific). Peptide samples were loaded (15 l/min, 3 min) onto atrap column 
(100 pm x2cm,5 pm Acclaim PepMap100C18,50 °C), eluted (200 nl/min) 
onto an Easy-Spray PepMap RSLC C18 column (2 um, 50 cm x 75 um ID, 
50 °C, Thermo Scientific) and separated with the following gradient 
(all percentages indicate buffer B: 0.1% formic acid in ACN): 0-110 min, 
2-22%; 110-120 min, 22-35%; 120-130 min, 35-95%; 130-150 min, iso- 
cratic at 95%; 150-151 min, 95-2%, 151-171 min, isocratic at 2%. Spray 
voltage was 1,900 V, ion transfer tube temperature was 275 °C, and RF 
lens was 30%. Mass spectrometry scans were acquired in profile mode 
(375-1,500 Da at 120,000 resolution (at m/z200)); centroided HCD MS/ 
MS spectra were acquired using a Top Speed method (charge states 2-7, 
3scycle time, threshold 2 x 10*, quadrupole isolation (0.7 Da), 30,000 
resolution, collision energy 30%) with dynamic exclusion enabled 
(5 ppm, 60s). Raw data files were uploaded to PEAKS X (Bioinformatics 
Solutions) for processing, de novo sequencing and database searching 
against the UniProtKB/Swiss-Prot Mouse Proteome database (down- 
loaded 1 December 2019; 22,286 entries), appended with a truncated 
sequence of mITGBI1 (+20 amino acids from the site of mutation), with 
mass error tolerances of 10 ppm and 0.01 Da for parent and fragment, 
respectively, no enzyme specificity, and methionine oxidation as a vari- 
able modification. False discovery rate (FDR) estimation was enabled, 
and proteins were filtered for -log,,P = 0 and one unique peptide to 
give 1% FDR at the peptide-spectrum match level. Peptides matching 
to mITGBI1 were manually verified by visual inspection. 


Antibodies 

For immune checkpoint therapy, rat IgG2a anti-PD1 (RMP1-14, Leinco) 
and mouse IgG2b anti-CTLA4 (9D9, Leinco Technologies) antibodies 
were used. Mice were injected intraperitoneally with 200 pg of each 
antibody on days 3, 6 and 9 after tumour transplantation. For multi- 
colour flow cytometry, we used antibodies against CD45 (30-F11), 
CD11B (M1/70), THY1.2 (30H12), CD4 (RM4-5), CD8B (YTS156.7.7), I-E/I-A 
(M5/114.15.2), CD64 (X54-5/7.1), LY6G (1A8), T-BET (4B10), CD150/SLAM 
(TC15-12F12.2), KLRGI (2F1), ICOS (15F9), CD44 (IM7), PD-1 (29F.1A12), 
SIINFEKL-H-2-K° (25-D1.16) (BioLegend), CD24 (M1/69), F4/80 (145-2342) 
(BD Biosciences), FOXP3 (FJK-16s, eBiosciences) and iNOS (CXNFT, Inv- 
itrogen). Zombie NIR (BioLegend) was used to stain for cellular viability. 
The BD Cytofix/Cytoperm Plus kit (BD Biosciences) was used according 


tothe manufacturer’s protocol for intracellular staining of iNOS, T-BET 
and FOXP3. 


Tetramer staining 

Tetramer staining for mLAMA4-specific CD8’ T cells was performed as 
previously described’. I-A° monomers bound to CLIP or mITGB1 werea 
gift from K. Wucherpfennig. For staining, biotinylated pI-A° monomers 
were labelled at a 4:1 molar ratio with streptavidin—APC or streptavi- 
din-PE (Prozyme). One million cells from whole-tumour digests were 
stained with equal amounts of APC and PE tetramer at 20 pg mI" for 
2 hat room temperature. Tetramer staining was stabilized through the 
use of anti-PE and anti-APC cells beads (Miltenyi), similar to previously 
published methods for MHC-Itetramers*, followed by surface staining 
for CD11B, THY1.2 and CD4. 


Multi-cytokine assay 

CD4* T cells were enriched from tumours 12 days after transplantation 
using the Miltenyi mouse CD4* enrichment kit. Ten thousand enriched 
CD4* T cells were stimulated in serum-free medium with 500,000 sple- 
nocytes isolated from naive mice pulsed with 2 pg mI peptide. Follow- 
ing a 24-h incubation, secretion of IL-10, IL-1B, IL-2, IL-4, IL-5, IL-6, IL-22, 
IL-9, IL-13, IL-27, IL-23, IFNy, IL-12 p70, GM-CSF, TNF, IL-17A and IL-18 was 
measured using a flow-based ProcartaPlex Ty1/T,,2/T,9/T17/T,22/Theg 
cytokine panel (Luminex Technologies) following the manufacturer's 
protocol. 


Plasmids 

Full-length mLAMA4 and mITGBI were cloned from T3 cDNA and full- 
length CIITA was cloned from 129S6 splenocytes. Gene blocks encod- 
ing SIINFEKL and the minimal epitope of mSB2 were purchased from 
Integrated DNA Technologies. All constructs were cloned into the Bglll 
site of pMSCV-IRES GFP (mLAMA4, CIITA, and mSB2) or pMSCV (mITGB1 
and SIINFEKL) using the Gibson Assembly method (New England Bio- 
labs). To generate neoantigen-expressing KP sarcoma cell lines and 
T3.CIITA cells, constructs were transiently transfected into Phoenix 
Eco cells using Fugene (Promega). After 48 h, viral supernatants were 
subsequently used for transfection of KP sarcoma line 9025 or T3 cells. 
KP.mLAMA4, KP.mITGB1, KP.mLAMA4.mITGB1, KP.mSB2.SIINFEKL and 
T3.CIITA clones were obtained by limiting dilution. 


CD4'T cell hybridomas and CTLL assay 

Bulk CD4* T cells from T3 tumours were isolated 12 days after transplan- 
tation and stimulated with lethally irradiated T3.CIITA cells to establish 
arapidly dividing cell line. CD4* T cells were fused with BW5147 cells and 
cloned via limiting dilution. To assess antigen specificity and to map the 
mITGB1 MHC-II binding core, splenocytes were collected from naive 
mice and pulsed with 10 pg mI” peptide unless otherwise stated. Fifty 
thousand hybridoma cells were incubated with 100,000 peptide-pulsed 
splenocytes overnight and culture medium was collected. Production 
of IL-2 was assayed by proliferation-dependent thymidine incorpora- 
tion using the IL-2 dependent CTLL-2 cell line. Data are represented as 
counts per million (cpm). 


Measuring IFNy production by CD8°T cell clones 

Tumour cells were treated with 100 U mI IFNy for 48 h before use. One 
hundred thousand CTL cells specific against mLAMA4 (74.17) or mSB2 
(C3) were co-cultured with 50,000 tumour cells for 48 h. IFNy in super- 
natants was quantified using an IFNy ELISA kit (eBioscience) according 
to the manufacturer’s protocol. 


In vivo cytotoxicity assay 

For targets, splenocytes were collected from naive mice, stained with 
either 5 Mor 0.5 pM CFSE (CFSE" and CFSE") (Thermo Fisher Scientific) 
and pulsed with either mLAMA4 (CFSE") or SIINFEKL (CFSE") peptide, 
respectively, at 1 uM overnight. Cells were washed extensively and 
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combined at a50:50 ratio in PBS, and 20 x 10° cells were injected retro- 
orbitally into tumour-bearing mice 11 days after tumour transplanta- 
tion. Naive, non-tumour bearing mice were used as a control. Spleens 
from tumour-bearing or control naive mice were removed 24 h after 
cell transfer, stained with Zombie NIR viability dye (Biolegend) and 
quantified for the presence of CFSE-labelled target cells. On histograms, 
equivalent heights of CFSE" and CFSE” peaks indicate that equivalent 
numbers of each cell population are present, and that no cytotoxicity 
was observed. Peaks that differ in height, where the CFSE” population 
is more abundant than the CFSE" population, indicate that cytotoxicity 
was observed specifically against the mLAMA4 peptide-pulsed, CFSE™ 
population of cells. The equation used for calculating per cent spe- 
cific lysis was [1 — (naive control ratio/experimental ratio)] x 100 with 
ratio = irrelevant percentage/specific epitope percentage. 


Statistics 

Statistical analysis was performed using GraphPad Prism software ver- 
sion 7. Unless otherwise noted, significance was determined with an 
unpaired, two-tailed Student’s t-test. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Nucleotide variant calls generated from cDNA capture sequencing of 
the T3 and KP9025 sarcomalines and used in the prediction of antigens 
shown in Fig. 1a, Extended Data Fig. 3a, b and 6b are provided as Sup- 
plementary Data 1and Supplementary Data 2. 


Code availability 


Code forthe hmMHCalgorithm used to predict presentation of neoanti- 
gens by I-A’ can be accessed at https://github.com/artyomovlab/hmmhc. 
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Extended Data Fig. 1| The hmMHC predictive algorithm and IEDB’18 H2-I-A” November 2018) represented as number of peptides per binding category and 
training dataset composition. a, An example of a fully connected HMM with measurement type (b, c) and binding category and peptide length (d). Strong 
three hidden states, and emissions corresponding to amino acids. binders: 1C;)<50 nM; binders: 50 nM <IC,,.<500 nM; weak binders: 

b-d, Composition of IEDB dataset (MHC full ligand export downloaded on 25 500 nM<IC;,<5,000 nM; non-binders: all remaining peptides. 
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Extended Data Fig. 2| Performance of hmMHC compared to netMHCII-2.3 
and netMHCIIpan-3.2. a, hmMHC (orange shapes) underwent 10x cross- 


validation. In each of the ten cross-validation partitions, on average there were 


4,412 binders in the training set, and 771 binders and 77,086 random natural 


peptides in the validation set. Performance was compared in terms of AUROC to 
the performance of netMHCII-2.3 (blue triangles) and netMHCllpan-3.2 (purple 


triangles) applied on the same validation sets. For hmMHC, performance for 
different numbers of hidden states is shown. For netMHCII-2.3 and 


netMHCllIpan-3.2, performance is shown for both predicted affinity and 
percentile rank (PR). b, Receiver operating characteristic (ROC) curves showing 
the performance of hmMHC onthe H2-I-A° dataset compared to existing 
predictors. ROC curves of all peptides and per specific peptide length for every 
cross-validation partition are shown. ¢, Illustration of percentile rank for strong 
binder classification calibrated on random natural peptides. Red lines indicate 
the percentile ranks of peptides screened for CD4" T cell reactivity. 
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Extended Data Fig. 3 | mITGB1is a major MHC-II-restricted neoantigeninT3 
sarcomas. a, b, T3 MHC-II neoantigen predictions for all expressed mutations 
were made using hmMHC (a) and netMHCII-2.3 (b) (netMHCIIpan-3.2 
predictions yielded very similar results, data not shown). The predictions are 
shownas —log,,odds predictor value or logIC., (smaller values indicate higher 
likelihood of being presented by I-A°) and expression level (FPKM). Strong 
binders are defined as mutations residing in the second percentile of 

I-A’ binding predictions for random natural peptides for each algorithm 
(-log,,odds < 26.21 or logIC;, < 343.8 nM). The N710Y mutation in ITGB1 met the 
strong binder threshold inthe hmMHC predictions but not in the netMHCII-2.3 
predictions. Red dots indicate all mutations that were screened for CD4* T cell 
reactivity. Green line denotes high-expression cut-off (FPKM = 89.1). Blueline 
indicates strong binder cut off for each algorithm. c, Two million T3 sarcoma 
cells were injected subcutaneously into syngeneic mice and CD4* TILs were 


isolated on day 12. IFNy ELISPOT was performed using naive splenocytes pulsed 
with 2 pg ml of the indicated peptides. Data are shownas mean of three 
independent experiments +s.e.m. d, Gating strategy for pI-A° tetramer staining 
of whole TILs. e, Quantification of mITGB1-tetramer and CLIP-tetramer staining 
of CD4" T cells from whole T3 TILs 12 days after transplantation. Data are shown 
as mean +s.e.m. per cent tetramer-positive cells of CD4* cells from three 
independent experiments. f, Syngeneic 129S6 mice were injected 
subcutaneously with 2 x 10° T3 sarcoma cells and TIL-derived CD4* T cells were 
collected 12 days after transplantation. CD4* T cells were stimulated with naive 
splenocytes pulsed with 2 pg ml OVA,,3-339 control or MITGB1,o7-724 peptide fora 
flow-based multi-cytokine array. Representative data from one of two 
independent experiments using pools of five tumours each are shown as 
average of technical triplicate wells from three pooled tumours. 
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Extended Data Fig. 4 | T3 TIL-derived CD4* T cell hybridomas are reactive indicated peptides. Representative data from one of three independent 
against mITGBI1. CTLL assay of T3 TIL-derived CD4' T cell hybridoma lines experiments are shown as average cpm from technical duplicate wells. 
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Extended Data Fig. 5| The mITGBI1 epitope is presented onI-A°. a, T3 CD4* 

T cell hybridomas were stimulated with 2 pg ml“ mITGB1(710Y) or wild-type 
ITGB1(710N) peptide-pulsed splenocytes. Activation was measured by CTLL 
assay. Representative data from three independent hybridomalines are shown 
as average of technical replicate wells. b, Mapping of the mITGB1 MHC-II binding 
core was performed using the CD4* T cell hybridoma line 41 stimulated with 
naive splenocytes pulsed with 2 pg ml of overlapping peptides covering 
MITGB1,o7-724. Red denotes the T3-specific mutant amino acid at position pl of 
the minimal epitope; underlining denotes the validated binding core. Green 
amino acids represent random residue substitutions used to specifically define 
valines at residues 715 and 718 as the p6 and p9 MHC-II binding positions and the 


complete MHC-II binding core. Representative data from two independent 
experiments are shownas the average of technical triplicate wells. c, MHC-III-A° 
staining of parental T3 cells, IFNy-stimulated T3 cells and T3 cells transduced 
witha vector encoding CIITA (T3.CIITA). Representative data from one of three 
independent experiments are shown. d, Mirror plot showing match between 
MS/MS spectra of the 14-mer peptide sequence encompassing the N710Y site of 
mITGB1eluted from T3.CIITA cells (positive axis) and a corresponding synthetic 
peptide (negative axis). Labelled m/z values reflect those experimentally 
observed for the endogenous peptide, with peaks representing bions 
highlighted in blue and yionsin red. 
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Extended Data Fig. 6 | mITGB1CD4'T cells are required for tumour rejection 
inresponsetoICT. a, Comparison of total number of expressed missense 
mutations between ten different MCA-induced sarcomas and KP9025 cells. 
Mutations were defined by whole-exome sequencing and RNA sequencing, and 
mutational load is shown ona per cell basis. b, Comparison of predicted 
neoantigen MHC-laffinity values between KP9025 and MCA-induced sarcoma 
F244 for H-2D? (top) and H-2K° (bottom). KP9025 cells were not predicted to 
express any MHC-Ineoantigens. c, Rag2” mice were subcutaneously injected 
with1x 10° KP.mLAMA4, KP.mITGB1, KP.mLAMA4.mlITGBI or KP.mSB2.SIINFEKL 
cells. Representative data from one of two independent experiments are 


presented as tumour diameters from individual mice (n=5 mice per group for 
KP.mLAMA4, KP.mITGB1and KP.mLAMA4.mITGB1and n=3 mice for KP.mSB2. 
SIINFEKL group per experiment). d, Wild-type syngeneic 129S4 mice were 
injected subcutaneously with 1 x 10° KP.mLAMA4, KP.mITGBI1 or KP. mLAMA4. 
mITGB1I cells and treated with anti-PD-1 (top) or anti-CTLA single agent ICT 
(bottom) on days 3, 6, and 9 after transplantation. Representative data from one 
of three independent experiments are shown as tumour diameters from 
individual mice (n=5inall groups per experiment). e, Survival curves for 
experiments ind and Fig. 2e (n=15 inall groups). 


a b 
300 * fe 
= om 
= 200 we WT 
< Y aPD-1 
“BB 100 * aCTLA-4 
oles 4 aPD-1+aCTLA4 
KP KP.mLama4 
c a 
100 Parental << 
Outgrown clone ys - & on on 
804 Sr SS LY LY 
se Parental+IFNy © ZT & € 
60+ Outgrown clonet+ g & 
‘S IFNy eS 
405 Q S 
x2 s 
204 
0 
0 10° 10° KP.mSB2.SIINFEKL.mITGB1 
SIINFEKL-H-2-K®-PE = aPD-1+aCTLA4 
d g 100 
% 2 
4 s 
ii 7) 
4% 5000 = 50 
mH 4000 8 
Yi 3000 ” 2 loo 
su = 2000 0 
> 1000 r 0 20 40 60 80 
eo seeree days post transplant 
© unstimulated IFNy stimulated 
Oo 


Extended Data Fig. 7 | Outgrowth of nonimmunogenic sarcoma cells 
expressing MHC-I neoantigens is not aresult of cancer immunoediting. 

a, Rag2~ or wild-type 129S4 mice were injected with 1 x 10° KP9025 or KP. 
mLAMA4 cells and treated with anti-PD-1, anti-CTLA or anti-PD-1+ anti-CTLA4 
on days 3, 6and 9 after injection. Tumours were removed once they reacheda 
maximum diameter of 20 mm inany direction and sarcoma cell lines were 
established ex vivo. Cell lines were stimulated with IFNy to upregulate MHC-I and 
subsequently used to stimulate the mLAMA4-specific CD8* 74.14 T cell clone. 
Secretion of IFNy by T cells was measured using enzyme-linked immunosorbent 
assay (ELISA). Representative data from two independent experiments are 
represented as the average of two independent tumour samples in each group. 
b, Wild-type 129S4 mice were injected with 1 x 10° KP.mSB2.SIINFEKL cells and 
treated with anti-PD-1+ anti-CTLA4 combination ICT on days 3, 6 and 9 after 
injection. Tumours were removed as ina. Established ex vivo cell lines were 


cloned by limiting dilution and parental KP.mSB2.SIINFEKL cells or individual 
clones from outgrown tumours were used to stimulate the mSB2-specific C3 
CD8*T cell clone; production of IFNy was quantified using ELISA. Representative 
data from four independent experiments are presented as average IFNy 
concentration of eight individual clones +s.e.m. Significance was determine 
using an unpaired, two-tailed t-test. c, Cell surface staining of SIINFEKL-H-2-K° 
expressed by unstimulated or IFNy-stimulated parental KP.mSB2.SIINFEKL cells 
or individual clones described in b. A representative histogram is shown. 

d, Quantification of mean+s.e.m. SIINFEKL-H-2-K® mean fluorescence intensity 
(MFI) from eight individual clonesinc. NS, not significant. e, Survival curves of 
wild-type 129S4 mice injected subcutaneously with 1 x 10° KP.mSB2.SIINFEKL. 
mITGBI cells. Mice were treated with control monoclonal antibodies or anti-PD-1 
+anti-CTLA4 combination ICT on days 3, 6 and 9 after injection. n=10 mice per 
group from two independent experiments. ****P=1.5 x 10°, Mantel-Cox test. 
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Extended Data Fig. 8 |mITGB1-specific CD4*T cells display an activated T,,1 
phenotype.a, Whole TILs from KP.mLAMA4.mITGB1 tumours 12 days after 
transplantation were stained with mITGBI-I-A° tetramers. Populations were 
previously gated on viable CD11b CD4* cells. Representative data from one of 
two independent experiments with five pooled tumours each are shown. 

b, mITGB1-I-A’ tetramer-negative and tetramer-positive cells described ina were 
analysed for expression of T-BET and FOXP3. Representative plots are shown. 

c, Quantification of two independent experiments in b as average per cent of 
tetramer-negative and tetramer-positive cells staining positive for the indicated 
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protein. Tumour-bearing mice were treated with control monoclonal antibodies 
or anti-CTLA4 on days 3, 6, and 9 after transplantation where indicated. 

d, mITGBI-I-A’ tetramer-positive and tetramer-negative cells ina were analysed 
for expression of PD-1. Representative plots are shown. e, Quantification of two 
independent experiments described ind shown as average per cent of tetramer- 
negative and tetramer-positive cells staining positive for PD-1.f, mITGB1-I-A° 
tetramer-positive cells ina were analysed for expression of the indicated 
proteins. Representative histograms from one of two independent experiments 
using pools of five tumours each are shown. 
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Extended Data Fig. 9 | CD4* T cell helpis required at the tumour site during 
primary and memory responses. a, Rag2” mice were simultaneously injected 
with1x 10° KP.mLAMA4 and KP.mLAMA4.mITGBI1 cells on opposite flanks. 
Representative data from one of two independent experiments are shown as 
individual tumour diameter (n=3 in each experiment). b, Wild-type 129S4 mice 
were injected with 1 x 10° KP.mITGB1 cells and were treated with anti-PD-1+ anti- 
CTLA4 combination ICT on days 3, 6, and 9 after injection. Representative data 
from one of two individual experiments are shownas individual tumour 
diameters (n=5inall experiments). c, Wild-type 129S4 mice were 
simultaneously injected with 1 x 10° KP. mLAMA4 and KP.mLAMA4.mITGBI cells 
on opposite flanks and treated as in b. Representative data from one of two 
individual experiments are shown as individual tumour diameters (n=5inall 
experiments). d, Wild-type 129S6 mice were injected subcutaneously with 2 x 10° 
T3 sarcoma cells and were treated with anti-PD-1+ anti-CTLA4 combination ICT 
on days 3,6, and 9 after injection. Following tumour rejection and a30-day 
recovery period, tumour-experienced mice were rechallenged with 2 x 10° T3 
cellsin the presence of control monoclonal antibody or CD4-depleting 
antibody, or with irrelevant sarcoma cells. Representative data from one of two 
independent experiments are shown as average tumour diameter +s.e.m. (n=5 
in all groups per experiment). e, Wild-type 129S4 mice were injected 
subcutaneously with 1 x 10° KP. mLAMA4.mITGBI1 cells followed by surgical 


(left flank) (right flank) 


resection 10 days after transplantation. After a 30-day recovery period, tumour- 
experienced mice were rechallenged with 1 x 10° KP9025, KP. mLAMA4.mITGBI1, 
or KP.mLAMA4 cells. Representative data from one of two independent 
experiments are shown as average tumour diameter + s.e.m. (n=Sinall groups 
per experiment). ****P=2 x 10 by two-way ANOVA with multiple comparisons 
and Bonferroni correction. f, Quantification of data from three independent 
experiments in Fig. 5c is shown as average number of spots +s.e.m. (left) and 
average number of mITGB1-specific CD4’ cells +s.e.m. (right). **P=0.003, 

****P= 7,2 x 10> (unpaired, two-tailed t-test). g, CD45*Ly6G MHCII'CD64*CD25" 
CD11b*F4/80* macrophages in TILs from mice bearing the indicated 
contralateral tumours were analysed for expression of iNOS 11 days after tumour 
transplant. Representative data from four independent experiments are shown. 
h, Quantification of iNOS* macrophages from experiments in fas a per cent of 
total CD45" cells. Data are shown as average +s.e.m. of four independent 
experiments. *P=0.03 by unpaired, two-tailed t-test. i, CD45*Ly6G MHCII*CD64* 
CD25 CD11b*F4/80* macrophages from the indicated contralateral tumours 
described were isolated 11 days after transplantation and analysed for 
expression of iNOS. Representative plots from two independent experiments 
are shown.j, Quantification of iNOS* macrophages from two independent 
experiments in his shownas average per cent of total CD45" cells. 
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Extended Data Fig. 10 | Gating strategies for multi-colour flow cytometry. Gating strategies for multi-colour flow cytometry analysis of tumour-infiltrating 
macrophage (a) and T cell (b) populations. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
Ld AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
“— Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Flow cytometry data was collected using BD FacsDiva software version 8.0. ELISPOT data was collected using a CTL ImmunoSpot S6 
Universal machine and Professional 6.0.0 software. The code for the hmMHC algorithm can be accessed at https://github.com/ 
artyomovlab/hmmhc. 


Data analysis Flow cytometry data was analyzed using FlowJo software version 10.2. Mass spectrometry data was analyzed using PEAKS X. All statistical 
analysis was performed using GraphPad Prism software version 7.0c. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Nucleotide variant calls generated from cDNA capture sequencing of the T3 and KP9025 sarcoma lines and used in the prediction of antigens shown in Figure 1a, 
Extended Data Figure 3a-b, and Extended Data Figure 6b are available within the article as Supplementary Data 1 and Supplementary Data 2. 
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Field-specific reporting 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes were determined based on 25 years experience with these types of tumors in general, and 6 years with these specific cell lines. 
Data exclusions No data was excluded from analysis. 


Replication All experiments used in this study were replicated in at least 2 independent experiments; the majority of experiments were performed at least 
3 times. All attempts at replication were successful. 


Randomization — In each experiment, all mice were obtained from the same cohort, and injected with the same preparation of tumor cells. Cages of animals 
were then randomly assigned into treatment groups. 


Blinding In these experiments, one person injected tumors, but longitudinal measurements and antibody treatments were performed by three 
independent people. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 
Antibodies 
Antibodies used Antibodies used for multi-color flow cytometry were CD45 (30-F11), CD11b (M1/70), Thy1.2 (30H12), CD4 (RM4-5), CD8B 
(YTS156.7.7), I-E/I-A (M5/114.15.2), CD64 (X54-5/7.1), Ly6G (1A8), T-bet (4810), CD150/SLAM (TC15-12F12.2), KLRG1 (2F1), ICOS 
(15F9), CD44 (IM7), PD-1 (29F.1A12), SIINFEKL-H-2-Kb (25-D1.16) (Biolegend), CD24 (M1/69), F4/80 (T45-2342) (BD Biosciences), 
Foxp3 (FJK-16s, eBiosciences) and iNOS (CXNFT, Invitrogen). 
Validation All antibodies used for flow cytometry are commercially available and validation materials are available on the appropriate 


websites. Tetramers were validated by staining cell populations from irrelevant tumors not expressing the indicated antigens. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) The T3 sarcoma cell line was generated via subcutaneous injection of methylcholanthrene into 129S6 Rag2-deficient mice. 
Resulting tumors were used to establish a parental sarcoma cell line, which was further cloned by limiting dilution to produce 
T3 (Matsushita et al., Nature 2012; Gubin et al., Nature 2014). The KP9025 cell line was generated via intramuscular injection 
of lentiviral cre-recombinase into the flanks of 129S4 strain mice expressing the KRas G12D oncogene and a floxed p53 
cassette. A resulting tumor was used to establish the KP9025 cell line. 


Authentication Cell lines were authenticated by whole exome sequencing and RNA-seq, and further validated by expression of particular 
antigens. 


Mycoplasma contamination All cell lines tested negative for mycoplasma contamination. 
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Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Experiments utilized both Rag2-deficient and wild type male mus musculus strains 129S6 and 129S4 between 8 and 10 weeks of 
age. It was necessary to match the sex and strain of mice to which the cell line was generated. 


Wild animals This study did not include wild animals. 
Field-collected samples This study did not include samples collected from the field. 
Ethics oversight All animal studies were performed with the approval of the Association for the Accreditation of Laboratory Animal Care- 


accredited Animal Studies Committee of Washington University in St. Louis. 
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Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 

Sample preparation For flow cytometry analysis of spleens, tissue was harvested, and dissociated into single cell suspension using syringe plungers 
and 40 micron cell filters. Red blood cells were lysed using ACK lysis buffer prior to staining. For flow cytometry of tumor 
infiltrating cells, tumors were harvested, manually disassociated into small pieces and digested with collagenase 1A. When 
appropriate, red blood cells were lysed prior to staining. For intracellular stains, cells were fixed and permeabilized using the BD 
Cytofix/Cytoperm Plus kit. 

Instrument Flow cytometry data was collected on either a BDFacs Canto II or BD LSRFortessa X-20. 

Software Flow cytometry data was collected using BD FacsDiva software version 8.0. Data was analyzed using FlowJo software version 10.2 


Cell population abundance __ No cell sorting was used. 


Gating strategy For MHC class | and class II tetramer staining, positive and negative population gates were defined based on staining with tumor- 
irrelevant tetramers. For, T-bet, Foxp3, PD-1, CD44, ICOS, CD150/SLAM, KLRG1, and iNOS staining, positive and negative 
populations were defined based on staining with an IgG antibody. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Gene duplication is acommon and powerful mechanism by which cells create new 
signalling pathways’”, but recently duplicated proteins typically must become 


insulated from each other and from other paralogues to prevent unwanted crosstalk’. 
Asimilar challenge arises when new sensors or synthetic signalling pathways are 
engineered within cells or transferred between genomes. How easily new pathways can 
be introduced into cells depends on the density and distribution of paralogous 
pathways in the sequence space that is defined by their specificity-determining 
residues*». Here we directly investigate how crowded this sequence space is, by 
generating novel two-component signalling proteins in Escherichia coli using cell 
sorting coupled to deep sequencing to analyse large libraries designed on the basis of 
coevolutionary patterns. We produce 58 insulated pathways comprising functional 
kinase-substrate pairs that have different specificities than their parent proteins, and 
demonstrate that several of these new pairs are orthogonal to all 27 paralogous 
pathways in £. coli. Additionally, from the kinase—-substrate pairs generated, we identify 
sets consisting of six pairs that are mutually orthogonal to each other, which 
considerably increases the two-component signalling capacity of E. coli. These results 
indicate that sequence space is not densely occupied. The relative sparsity of 
paralogues in sequence space suggests that new insulated pathways can arise easily 
during evolution, or be designed de novo. We demonstrate the latter by engineering a 
signalling pathway in E£. colithat responds to a plant cytokinin, without crosstalk to 
extant pathways. Our work also demonstrates how coevolution-guided mutagenesis 
and the mapping of sequence space can be used to design large sets of orthogonal 
protein-protein interactions. 


Many promising therapies, suchas chimeric antigen receptor T cells and 
engineered probiotic microorganisms’, require the ability to repurpose 
and transfer signalling pathways into new genomic contexts*”®. Simi- 
larly, new pathways arise during evolution through the duplication and 
diversification of existing signalling mechanisms. For engineered and 
evolved signalling proteins to execute independent functions within 
cells, they must minimize detrimental crosstalk, which is a substantial 
challenge for proteins with paralogues. 

For signalling proteins such as protein kinases and their substrates, 
specificity is enforced primarily at the amino acid level””. These spec- 
ificity-determining residues define a finite sequence space. How cells 
globally organize paralogous protein families in sequence space, and 
whether the specificity-determining residues of individual members 
of these families have been optimally distributed to minimize cross- 
talk during evolution, remain open questions. Work with SH3 and PDZ 
domains in eukaryotes has suggested that paralogues are densely packed 
in sequence space, a result of intense negative selection against crosstalk 


leading to a global optimization of specificity*>. However, sequence 
space is vast and nature may not have fully occupied or explored it. 

To assess how crowded paralogues are in sequence space, we sought 
to engineer protein complexes that are functional but insulated from 
extant paralogues. If sequence space is densely occupied by existing 
paralogues, it should be difficult to introduce new insulated pathways 
(Fig. la, Extended Data Fig. 1a). However, if sequence space is sparsely 
occupied, new pathways should be easy to introduce, and have a low 
probability of crosstalk. The design of orthogonal interacting proteins 
remains a major challenge, and previous efforts have generated only 
3-4 orthogonal pairs? », 

We focused on bacterial two-component signalling proteins, which 
involvea sensor histidine kinase that—upon activation—autophospho- 
rylates and then transfers its phosphoryl group to a cognate response 
regulator to effect changes in cellular behaviour’? (Fig. 1b, Extended 
Data Fig. 1b, c). Most histidine kinases are bifunctional, and act as phos- 
phatases in the absence of a signal to dephosphorylate their cognate 


‘Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA. 7Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA. 
’Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA. *e-mail: laub@mit.edu 
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Fig. 1| Probing the density of paralogues in sequence space by building new 
orthogonal pairs of signalling proteins. a, Two models for the distribution of 
paralogues in sequence space. Each oval is aniche that represents the set of 
substrates with whicha kinase can interact, given its specificity-determining 
residues. These niches could be densely packed in sequence space (top) or more 
sparsely distributed (bottom), which would make the introduction of new 
insulated histidine kinases (denoted as HK1* to HK6*) relatively difficult or easy, 
respectively. b, Depending on Mg” levels, £. coliPhoQ (blue) can phosphorylate 
or dephosphorylate PhoP (green) to stimulate or repress, respectively, gene 
expression—including that of a YFP reporter (bottom).c, Alibrary of about 

5 x 108 PhoQ-PhoP variants was first examined by flow cytometry before growth 
overnight in low levels of Mg”, followed by outgrowth with high or low levels of 
Mg”*, and FACS, with deep sequencing of 8 consecutive bins. For each pair of 
variants, the number of reads in each bin was plotted to infer its fold-induction 
value. d, Pearson correlation between YFP values inferred by sort-seq, and 
measured individually. Points indicate mean +s.d.,n=2 biological replicates. 
AU, arbitrary units. e, For each pair of variants indicated (including the wild-type 
pair (top)), the signalling profile inferred by sort-seq (left) is compared to that 
measured inisolation. Experiment was repeated three times with similar results. 
RR, response regulator. f, Plot of mean YFP level inferred by sort-seq, for each 
pair of variants in the ‘off’ and ‘on’ states. 


regulator. Bacteria usually encode dozens of two-component pathways 
(Fig. la, Extended Data Fig. 1d) that are mutually insulated; the vast 
majority of kinase-regulator pairs form exclusive, one-to-one relation- 
ships”. Both kinase and phosphatase activities, which involve the same 
protein-protein interface, contribute to pathway specificity”. 

The specificity of interaction between kinases and regulators is driven 
by a limited set of interfacial residues in each protein that strongly 
coevolve with one another” (Extended Data Fig. 1c). To identify combi- 
nations of these residues that are functional and insulated from existing 
pathways in £. coli, we constructed a dual library of mutants in which 
the 11 key coevolving interface residues of a canonical two-component 


system (PhoQ-PhoP) were randomized”°” (Fig. 1c, Extended Data 
Fig. 1c-e). 

To identify functional combinations of residues, we first grew the 
library of PhoQ-PhoP variants overnight in medium with low Mg”*, 
which activates PhoQ. Because cells must phosphorylate PhoP to grow 
when Mg” is limiting this step enriches for functional PhoQ—-PhoP 
variants (Fig. 1c). Variants that survived these limiting Mg”* conditions 
were then subjected to fluorescence-activated cell sorting (FACS) and 
deep sequencing to quantify their signal responsiveness (sort-seq) 
(Extended Data Figs. 2, 3). To gauge PhoP activity in vivo, we used a fluo- 
rescent reporter, Pings-yfp (Supplementary Tables 1, 2). In conditions of 
low extracellular Mg”, functional PhoQ promotes PhoP phosphorylation 
and production of YFP, whereas with high levels of Mg”* PhoQ drives 
PhoP dephosphorylation to limit the accumulation of YFP (Fig. 1b). 

Toidentify signal-responsive variants, we sorted cells from each con- 
dition into eight bins and deep-sequenced the randomized regions of 
variants in each bin (Fig. 1c). We calculated the frequency of each vari- 
ant pair in each bin to yield distributions of individual variant pairs under 
conditions of low and high Mg”*. We then assessed the mean levels of 
YFP in each condition, as well as the fold induction (or signal responsive- 
ness) of each variant pair. 

To validate our selection scheme, we isolated at random 48 individual 
clones from our starvation-enriched library and measured the distribu- 
tion of YFP levels under conditions of low and high Mg”* (Fig. 1d). Of 
these 48 clones, 32 had sufficient sequencing coverage for quantifi- 
cation by sort-seq. Each individual flow-cytometry profile showed a 
strong similarity to the distribution inferred from sort-seq, including 
the profiles of variants with high fold-induction values and those that 
were constitutively ‘on’ (Fig. le). The constitutively on behaviour prob- 
ably arises when a PhoQ variant lacks phosphatase activity, as PhoP can 
then accumulate phosphoryl groups from other sources (for example, 
acetyl phosphate); this behaviour is seen with a AphoQ strain” and in 
variants that contain a stop codon in phoQ (Fig. 1f). 

To select signal-responsive variants (similar to wild-type PhoQ-PhoP), 
we identified combinations of residues that produced fold-induction 
values of more than 20; there were 502 of these sequences, which we 
hereafter refer to as functional PhoQ*-PhoP* variants (these are num- 
bered sequentially as PhoQ,*-PhoP,*, PhoQ,*-PhoP,* and so on) (Fig. 1f, 
Extended Data Fig. 2i). Most combinations of residues that were identi- 
fied as functional shared few identities with the residues of wild-type 
PhoQ-PhoP. We isolated and characterized 41 diverse PhoQ*-PhoP* 
variants that showed more than 20-fold induction, and shared fewer 
than 5 residue identities with the wild-type PhoQ-PhoP at the 11 ran- 
domized positions (Fig. 2a). To test whether these PhoQ*-PhoP* pairs 
were insulated from the wild-type proteins, we built strains in which 
each of the PhoQ* and PhoP* variants was tested for interaction with 
a wild-type partner (Supplementary Table 3). For 16 of the 41 pairs we 
tested, there was substantially higher Mg**-dependent signalling with 
the mutant-protein pair than for either mutant paired with a wild-type 
partner (Fig. 2a). Thus, these 16 pairs do not crosstalk with the wild- 
type PhoQ-PhoP proteins, which indicates that our selection scheme 
produced functional signalling pathways that are insulated from the 
parental proteins. 

To examine the mechanistic basis of insulation, we purified the PhoP* 
variants and the cytoplasmic domains of the PhoQ* variants for 7 of the 
16 validated insulated protein pairs. The autokinase, phosphotransfer, 
and phosphatase activities of a histidine kinase can be assessed using a 
single assay””* (Fig. 2b). Each kinase is autophosphorylated with [y-”P] 
ATPand then mixed witha given partner, which results in phosphotrans- 
fer anda phosphorylated response regulator; as the unphosphorylated 
kinase accumulates it stimulates the dephosphorylation of a response 
regulator, which leads to the depletion of the radiolabelled response 
regulator. 

For three of the functional mutant pairs, we observed the same 
pattern of activities as were observed with wild-type proteins, which 
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Fig. 2| Orthogonality of selected pairs of PhoQ-PhoP variants. a, Fold- 
induction values measured by flow cytometry for PhoQ*-PhoP* pairs (right), 
compared to PhoQ* with wild-type PhoP (left) and wild-type PhoQ with PhoP* 
(middle). The interface residues of each pair are listed, along with the number of 
residues that share identity with those of the wild-type proteins. b, Time course 
ofautophosphorylated PhoQ or PhoQ,,* incubated with PhoP or PhoP,;*. For 
PhoQ*-PhoP* and PhoQ-PhoP, phosphotransfer leads initially to 
phosphorylated PhoP and a concomitant decrease in phosphorylated PhoQ. 


demonstrates that these PhoQ*-PhoP* pathways contain functional 
phosphotransfer and phosphatase activities (Fig. 2b, Extended Data 
Fig. 4a). We then tested each mutant protein with a wild-type partner. 
We found that the wild-type PhoQ could phosphotransfer to some PhoP* 
variants but could not efficiently dephosphorylate them. Similarly, we 
found that the PhoQ* variants could phosphorylate wild-type PhoP but 
could not dephosphorylate it. Thus, the orthogonality of these variant 
pairs with respect to the wild-type system is driven largely by highly spe- 
cific phosphatase activity. 

For four of the functional insulated mutant pairs that we purified, the 
kinases exhibited no detectable autokinase activity in vitro. However, 
they retained phosphatase activity and each was specific for the selected 
cognate PhoP* partner relative to the wild-type PhoP (Fig. 2c, Extended 
Data Fig. 4b). Thus, the phosphatase activity of a PhoQ* variant may be 
sufficient to support a functional and insulated PhoQ*-PhoP* system; 
phosphoryl donors suchas acetyl phosphate (or other kinases) would 
drive PhoP* phosphorylation under conditions of low Mg”, and the 
phosphatase activity of PhoQ* would ensure that PhoP* is not active 
under conditions of high Mg”. 

Next, we assessed the orthogonality of functional variants with respect 
to the other two-component signalling pathways in E. coli. We purified 
27 response regulators from F. coli and assayed, in parallel, the ability 
of PhoQ,,* and PhoQ,,* to phosphorylate each regulator as well as their 
corresponding partner (PhoP,,* and PhoP,,*, respectively) (Fig. 2d). No 
phosphotransfer was detected from either kinase to any of the endoge- 
nous £. coliresponse regulators after 5 min, except to the wild-type PhoP. 
However, as shown in Fig. 2b, the PhoQ,,*—-PhoP,,* and PhoQ,,*—PhoP,;* 
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PhoQ eventually becomes unphosphorylated and drives dephosphorylation of 
PhoP, which yields a blank lane. c, PhoQ* variants with no detectable autokinase 
activity specifically dephosphorylate their cognate PhoP* variants. 
Phosphorylated PhoP* variants were incubated with their cognate PhoQ* or 
wild-type PhoQ for the times indicated. d, Phosphotransfer profiles of wild-type 
PhoQ (top) and three PhoQ* variants. In each case, autophosphorylated kinase 
was incubated for 5 min with each of 27 £. coliresponse regulators, and its 
selected PhoP* variant. All gels are representative of two or more replicates. 


pairs are insulated from the wild-type PhoQ-PhoP by the highly spe- 
cific phosphatase activity of each kinase. Some phosphotransfer to 
other regulators was detected after longer incubations, at which point 
many histidine kinases exhibit promiscuity” (reflecting their homology) 
(Extended Data Fig. 4c). We also examined phosphotransfer from 5F. coli 
kinases to 11 PhoP* variants (Extended Data Fig. 5); in each case, the native 
kinase preferentially phosphorylated its cognate response regulator. 
PhoP* variants were typically more insulated than wild-type PhoP from 
these noncognate kinases. Finally, we phosphorylated 12 F. coli response 
regulators and then examined their dephosphorylation by three PhoQ* 
variants (Extended Data Fig. 6); in each case, PhoQ* robustly dephos- 
phorylated its cognate PhoP*, but not the native response regulators. 
Taken together, these results indicate that the functional PhoQ*-PhoP* 
variants that we identified are insulated from native E. coli pathways. 
To further test the global insulation of selected PhoQ*-PhoP* variants, 
we used RNA sequencing (RNA-seq) to examine gene expression when 
strains that carry one of six pairs of PhoQ*-PhoP* variants were grown 
under conditions of excess or limiting Mg”* to repress or stimulate PhoQ, 
respectively. Wild-type and PhoQ*-PhoP* systems produced a similar 
induction of known PhoP-dependent genes (Extended Data Fig. 7a). To 
assess whether a variant PhoQ* crossphosphorylated other response 
regulators, we took advantage of the fact that active response regula- 
tors typically autoregulate, and promote expression of themselves and 
their cognate histidine kinase’. Notably, none of the six strains that we 
tested showed significant induction of other two-component systems 
relative to a wild-type control (Extended Data Fig. 7b-d). These RNA- 
seq analyses further indicate that the PhoQ*-PhoP* systems that we 
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Fig. 3 | Identification of sets of orthogonal signalling proteins. a, Fold- 
induction values for all possible pairings froma set of 79 PhoQ*-PhoP* variants. 
The matrix was clustered in both dimensions. b, Number of unique sets of 
various sizes of orthogonal PhoQ*-PhoP* pairs, with fold-induction values 

>15 for cognate pairs and <6 for all non-cognate pairs. c, A set of five PhoQ* and 
PhoP* variants that are functional and mutually orthogonal. d, A set of 9 PhoQ* 
and PhoP* variants that are functional and mutually orthogonal, with fold- 


identified are globally insulated from other pathways. Thus, there are 
unoccupied regions of sequence space, in which newsystems with novel 
interaction specificities can be introduced without producing crosstalk 
to existing systems. 

Wealso wanted to test the insulation of our selected functional PhoQ*- 
PhoP* variants with respect to each other. To this end, we selected 79 vari- 
ant pairs that had high fold-induction values and a broad sequence 
diversity. We then combinatorially combined these PhoQ* and PhoP* 
variants, which produced a library witha theoretical diversity of 5,609. 
This library was transformed into cells containing the P,n2-s-y/p reporter, 
and was subjected to sort-seq (Fig. 1c, Extended Data Fig. 8a, b), which 
enabled us to infer the fold induction of each combination of PhoQ* 
and PhoP* variants. Within the resulting interaction matrix (Fig. 3a, 
Supplementary Table 4), 58 variant pairs were orthogonal to the wild- 
type proteins (Extended Data Fig. 8c). 

To isolate orthogonal sets of PhoQ*-PhoP* variants, we searched the 
79 x79 interaction matrix for sub-matrices in which strong interactions 
were seen only along the diagonal. We isolated more than 2,500 unique 
sets of 5 orthogonal signalling pairs, and dozens of sets of 6 orthogonal 
pairs (Fig. 3b, c, Extended Data Fig. 8d, e). Witha slightly relaxed thresh- 
old for noncognate interactions, we found sets with up to 9 orthogo- 
nal protein pairs (Fig. 3d). To verify the orthogonality of these sets of 
PhoQ*-PhoP* variants, we cloned and analysed the 25 individual pairs 
that comprised a specific 5 x 5 matrix. Flow cytometry analysis showed 
strong agreement with the fold-induction values that were inferred 
using sort-seq (Fig. 3c, e). 

Notably, the noncognate pairs in Fig. 3 were measured in the absence 
of the cognate partner of each variant. Any weak crosstalk that is seen 
should be eliminated by the phosphatase activity of the cognate 
kinase’”"®; this prediction was confirmed for one instance of a PhoP* 


(measured individually) 


RR* variants 


induction values >17 for cognate pairs and <10 for non-cognate pairs. e, Fold- 
induction values for the mutant combinations inc measured individually. f, Asin 
e but witha point mutant that reduces crosstalk (PhoP*(VHSCL) to 
PhoP*(VYSCL)). g, Model for subfunctionalization of PhoQ in sequence space. 

h, Subfunctionalization of PhoQ specificity. A set of three PhoQ*-PhoP* variants 
that are mutually insulated but that retain interactions with the parent proteins. 


variant that exhibited modest crosstalk from wild-type PhoQ unless its 
cognate PhoQ* was also expressed (Extended Data Fig. 8f, g). Thus, the 
off-diagonal values that are seen with the orthogonal sets in Fig. 3b-e 
represent upper limits on crosstalk. This small degree of crosstalk is also 
easily reduced further. For example, with the set in Fig. 3e, we screened 
point mutations of PhoP*(VHSCL) for reduced crosstalk, and found that 
PhoP*(VYSCL) had reduced interaction with noncognate kinases while 
maintaining the interaction with its cognate kinase PhoQ*(SCEHLI) 
(Fig. 3f). 

The 79 x79 matrix of interactions (Fig. 3a) also offered insight into how 
new pathways can arise through subfunctionalization (the partitioning 
of aniche in sequence space rather than movement into a new region) 
(Fig. 3g). For example, we found three pairs of PhoQ*-PhoP* variants that 
were insulated from each other, but each of which exhibited substantial 
crosstalk to the parental wild-type proteins (Fig. 3h, Extended Data 
Fig. 8h-k). Thus, these pairs have effectively partitioned the original 
niche of wild-type PhoQ (and PhoP) in sequence space to yield three 
insulated pathways. Subfunctionalization of duplicated proteins derived 
from a promiscuous ancestor may be acommon mechanism through 
which insulated paralogues arise during evolution. 

Collectively, our results indicate that the sequence space of two-com- 
ponent signalling pathways in F. coliis relatively sparsely occupied, such 
that new orthogonal signalling pathways can readily be introduced. 
The 502 functional variant pairs that we have isolated here have few 
specificity residues in common with wild-type PhoQ-PhoP, each other, 
or extant two-component signalling proteins (Extended Data Fig. 9a). A 
force-directed graph based on similarities in specificity residues high- 
lights the diversity of naturally occurring interfaces and of the variants 
that we isolated (Extended Data Fig. 9b). To estimate how easily anew 
insulated pathway can be introduced, we noted that 502 functional 
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Fig. 4 | Construction ofan insulated sensor. a, Aninsulated cytokinin (trans- 
zeatin) sensor (AQ,*), constructed by fusing the sensory domain of A. thaliana 
AHK4 to PhoQ,’*. b, The chimeric sensor AQ," is specifically responsive to1 iM 
trans-zeatin, phosphorylating its cognate mutant PhoP,* to activate a YFP 
reporter. Wild-type PhoQ (Qw;) responds only to Mg” and not to the cytokinin. 
Bars indicate mean from n=3 biological replicates. 


pairs came from 10,595 pairs with quantifiable fold-induction values. 
Of these 502, about 40% are probably insulated from wild-type PhoQ- 
PhoP (Fig. 2a), which implies that around 200 (1.6%) of the 10,595 pairs 
are both functional and insulated. This frequency is an upper bound, as 
the initial 10,595 pairs arose from selection under conditions of low Mg”* 
(Fig. 1c), which enriches about 100-fold for functionality (Extended Data 
Fig. 2b). Nevertheless, given the size of sequence space, these estimates 
underscore the relative ease of creating kinase—-substrate pairs that are 
functional and orthogonal to their parent proteins. 

Orthogonal signalling pathways will be useful in generating synthetic 
sensors and novel regulatory systems. As an example, we sought to 
generate a new pathway in £. coli that responds to the cytokinin trans- 
zeatin, a plant hormone. The histidine kinase AHK4 from Arabidopsis 
thalianasenses trans-zeatin, but crosstalks extensively with a native two- 
component pathway in £. coli’. To overcome this limitation, we fused 
the AHK4 sensory domain to the kinase domains of an orthogonal PhoQ* 
and expressed this construct in £. coli with the cognate PhoP* (Fig. 4a, 
b). This engineered sensor kinase enabled £. colito respond specifically 
to trans-zeatin, and was insulated from all native two-component path- 
ways, as measured by phosphotransfer profiling and RNA-seq (Extended 
Data Fig. 10). Thus, this chimeric sensor kinase and its cognate PhoP* 
expand the sensory repertoire of F. coli without introducing undesir- 
able crosstalk. 

To date, synthetic circuits have mainly been built from nucleic-acid 
components because of their intrinsic modularity and programma- 
bility”. Protein-based circuits offer faster response times and richer 
functionality, but require more complicated programming of protein 
interactions. Our work enables the design of two-component signalling- 
based circuits in bacteria or eukaryotes, and the relatively sparse distri- 
bution of paralogues in sequence space means that multiple pathways 
can readily be introduced. 

In summary, our work highlights the power of using coevolution- 
guided libraries to investigate protein-protein interactions and sup- 
ports a model in which sequence space is not densely occupied. The 
relatively sparse distribution of extant proteins in sequence space pre- 
sumably reflects their evolutionary history. A previous study indicated 
that duplicated signalling proteins are under pressure immediately after 
duplication to change and become insulated, but subsequent movement 
in sequence space then arises only from neutral changes®. Although 
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duplicated proteins are initially subject to selection against crosstalk 
with each other, each protein is probably not subject to system-wide 
negative selection or global optimization. 
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Methods 


Bacterial strains and media 

E.colistrains were grown in M9 medium (1x M9 salts, 100 uM CaCl,, 0.2% 
glucose, 0.1% casamino acids and MgSO, at indicated concentrations). 
When indicated, antibiotics were used at the following concentrations: 
carbenicillin 50 pg/ml, kanamycin 50 pg/ml, spectinomycin 50 pg/ml 
and chloramphenicol 32 pg/ml. 

The base strain for all studies was £. coli strain TIM175 (MG1655 
AphoPQAlacZYA attA::[P merg-YfP] attHK::[P j-c4-Cfp+])” with a ColE1/amp* 
plasmid (pCM150) containing Pigs-yfp. All libraries were cloned onto 
alow-copy pSC101/spec® plasmid (pCMO099, a derivative of pLPQ2), in 
which phoP-phoQwas driven by aconstitutive lacUV5 promoter. We also 
introduced a bicistron RBS (BCD18) upstream of phoP-phoQ”®, which 
leads to expression of a single transcript that encodes a small (17 amino 
acid) open reading frame, followed by an independent ribosome bind- 
ing site and then phoP-pho@. This configuration ensures that mutations 
near the 5’ end of the phoP coding region do not substantially affect 
expression by changing interactions between the 5’ end of phoP and the 
upstream leader sequence. Expression from the /acUVS5 promoter ina 
plasmid probably produces more PhoQ-PhoP thanis natively produced, 
which increases the chance of crosstalk; thus, the variant pairs identi- 
fied as orthogonal would perform even better with respect to crosstalk 
at lower expression levels. Additional characterizations of PhoQ* and 
PhoP* variants isolated from the library were done with a three-plasmid 
setup: reporter plasmid pCM150, pCM143 (lacUV5-BCD18-phoP, pSC101/ 
spec®) and pCM149 (lacUV5-RBS_BOO34-phoQ, p15A/kan®). Point muta- 
tions were introduced using blunt-end ligation” and Gibson assembly”s. 


Flow cytometry characterization 

To induce PhoQ-PhoP, cells were grown to mid-exponential phase (opti- 
cal density at 600 nm (OD, 0) of about 0.5) in M9 before being washed 
once with M9 containing O mM MgSO, and diluted 1:100 into M9 con- 
taining 10 uM MgSO, (for induction) or 50 mM MgSO, (for repression). 
Cells were grown for 6h, diluted 1:50 into PBS with 0.5 g/I kanamycin, and 
fluorescence was measured ona Miltenyi MACSQuant VYB. An identical 
procedure was used to induce AHK4—PhoQ fusions, except that cells 
were grown in M9 containing 2 mM MgSO, and 1nManhydrotetracycline 
(aTc) (Sigma) at all times, with the on condition containing 1 pM trans- 
zeatin (Sigma) and the off condition containing no trans-zeatin. In each 
cytometry experiment, three replicates of each sample were induced 
independently and 20,000 cells were measured per replicate. FlowJo 
was used to analyse the data, gating on single live cells and extracting 
the geometric mean of the YFP distribution (Extended Data Fig. 2j). Error 
bars indicate the s.d. of the geometric means measured in each replicate. 


Design and assembly of the degenerate PhoQ-PhoP library 

The PhoQ-PhoP saturation mutation library was constructed by replac- 
ing the targeted residues with NNS codons”’ ™. The residues targeted 
were selected onthe basis of amino acid coevolution analyses performed 
using GREMLIN®. 

The plasmid library was assembled in two general steps: (1) individual 
PhoP and PhoQ libraries were built in separate vectors and (2) sections of 
these vectors were combined to produce anew vector containing both 
the PhoP and the PhoQ mutant (Extended Data Fig. 2a). For the first step, 
oligonucleotide libraries for the sections of phoP and phoQto be mutated 
were ordered from DNA 2.0. NNS nucleotides replaced codons 12, 14, 15, 
18 and 19 in PhoP and codons 284, 288, 289, 292, 302 and 303 in PhoQ. 
These oligonucleotides were cloned into vectors pCMO715 and pCM076 
using the type IIS restriction enzyme BsmBI. A toxic ccdB locus on these 
plasmids (used as a counter selection) was replaced during the process, 
ensuring a high rate of insertion incorporation. Both insert and vector 
were digested with BsmBI at 55 °C for 2 hand then purified on a Zymo 
DNA clean column. Then, 1 pmol of both insert and vector was combined 
in a 25-pl reaction with 400 units of T4 ligase and incubated at 16 °C 


for 16 h. Three ligations of each library were done, to ensure sufficient 
numbers of transformants. Ligations were dialysed on Millipore VsWP 
0.025-um membrane filters for 60 min and then the entire volume was 
electroporated into 20 ul of Invitrogen MegaX DHIOB cells. From three 
ligations of each library, a total of 2.3 x 108 and 7.4 x 10’ transformants 
were obtained for the PhoQ and PhoP libraries, respectively. 

Bsal sites were used to join the two sublibraries into a single plasmid. 
The fusion points were designed such that faulty assemblies would not 
be viable: one junction was within the spec® cassette (both the PhoP and 
PhoQsublibraries contained akan*® cassette, but each contained only one 
half of the spec® open reading frame) and the other was within PhoQ. In 
brief, 500 fmol of minipreped DNA product (Qiagen) from each of the 
two libraries was combined in 25 ul T4 ligase buffer and digested with 
1 pl Bsal for 1h at 37 °C. T4 ligase was then added and the reaction was 
cycled between 16 °C (3 min) and 37 °C (2 min) for 50 cycles to allow 
iterative ligation and digestion, running the reaction to completion. 
The final ligation product was dialysed on Millipore VSWP 0.025-um 
membrane filters for 60 min and the entire volume was electroporated 
into 20 pl of Invitrogen MegaX DH1OB cells. In total, 12 ligations and elec- 
troporations were done to produce a total of 5.72 x 10° transformants. 
Transformations were pooled and grown overnight (14 h) in 100 ml 2x 
YT +carbenicillin and spectinomycin. Following assembly, the plasmid 
library was purified by miniprep (Qiagen), dialysed and electroporated 
into AphoP-phoQ strain CJM2044 to yield 3.8 x 10° transformants. 


Library selection and sort-seq 

The PhoQ-PhoP library was subjected to an initial selection of Mg?* 
starvation to enrich for functional variants before performing FACS. To 
this end, 6 ml ofan overnight culture of the library (in 2x YT) was washed 
in M9 and diluted to an OD, of about 0.1in 100 ml M9 containing 2mM 
MgSO,. Three replicates of this culture were made, and carried separately 
through the subsequent selection, FACS and deep-sequencing process. 
Cells were grown for approximately 2 h to OD,o, of about 0.4, at which 
point 1.6 ml of culture was washed 3 times in M9 containing no MgSO,, 
and used to inoculate 100 ml of M9 containing no MgSO,,. After each dilu- 
tion, the culture was sampled and a dilution series was plated on LB plates 
to ensure no bottlenecking occurred (colony-forming units >1 x 10’). 
The cultures in M9 containing no MgSO, were grown overnight (14 h), 
withthe OD,,, increasing from 0.05 to only about 0.07. MgSO, was then 
added to bring the concentration to 2 mM, and cells were grown to an 
OD,o0 of 0.5 in 6h, at which point glycerol stocks were made. 

For FACS, 1-ml glycerol stocks were thawed and inoculated in 25 ml of 
M9. For each library replicate, 1 frozen stock aliquot was added directly 
to M9 containing 50 mM MgSO, (off state) and 1 aliquot was washed 3 
timesin M9 containing 0 mM MgSO, before inoculation in M9 with 10 uM 
MgSO, (onstate). To maintain cells in exponential phase, cultures were 
diluted (1:4 for on state, 1:10 for off state) after 3 h. After 6h, cells were 
diluted again (1:5), and chloramphenicol was added to a concentration 
of 320 pg/ml and cells were placed onice for sorting. CFP was expressed 
at alowconstitutive level (attHK::[P;.:4-cfp]) and used to normalize YFP 
expression. Cells were sorted into ratiometric bins on the diagonal of 
CFP and YFP expression, to control for extrinsic expression noise inthe 
YFP signal. For each library replicate, both the on and off cultures were 
sorted into 8 separate bins, generating 48 total bins. Up to 2.5 million 
cells were sorted into bins per replicate (Extended Data Figs. 2c, 8a). 
Sorted cells were added to 2x YT medium containing 2 mM MgSO,, 
carbenicillin and spectinomycin, and then grown overnight. 


Illumina sample preparation 

After FACS, plasmids were purified (Qiagen MiniPrep) from overnight 
cultures representing each bin from each library replicate. For the two 
mutagenized regions of the plasmid to be brought into close enough 
proximity (<790 bp) for paired-end Illumina sequencing (Extended 
Data Fig. 2b), plasmids were digested with Xhol and then self-ligated (T4 
ligase, 4h). To isolate only self-ligation products and not cross-ligation 
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products, ligation reactions were cleaned (Zymo PCR Clean Up) and gel- 
purified to select for the correct size on FlashGels (Lonza). Two PCR reac- 
tions were performed, both using KAPA HiFi Hotstart, to add Illumina 
sequencing adaptors and barcodes. First, ligation reaction products 
were amplified for 30 cycles (95 °C for 30 s, 65 °C for 15 s and 72 °C for 
120 s) with Illumina inner applification primers (Supplementary Table 
3) in an emulsion PCR (Micellula Emulsion PCR) to avoid PCR chimeras. 
Second, purified PCR product from the first reaction was subjected to 
asecond PCR with barcoding primers for 9 cycles (95 °C for 30 s, 65 °C 
for 15s and 72 °C for 60s). Final products were quantified (NanoDrop), 
normalized, combined and sequenced on an Illumina NextSeq. For each 
bin, 1-33 million reads were collected. 


Construction of combinatorial mutant library 

Seventy-nine pairs of PhoQ*—PhoP* variants were selected that dis- 
played broad sequence diversity and high fold induction in the initial 
PhoQ*-PhoP* library. These pairs included 79 unique PhoQ* variants 
and 71 unique PhoP* variants. These unique variants were cloned into 
plasmids pCM143 (PhoP) and pCM149 (PhoQ) using blunt-end ligation” 
and Gibson assembly”. Each pCM143 and pCM149 variant was amplified 
by PCR (KAPA Hifi, 30 cycles) to generate amplicons for Gibson assembly 
(primers CM937/CM1531 for pCM149 and primers CM938/CM1532 for 
pCM143). PCR products were cleaned (Zymo PCR Cleanup), quantified 
by NanoDrop and combined into an equimolar mix of pCM143 ampli- 
cons and an equimolar mix of pCM149 amplicons. The two mixes were 
combined in Gibson assembly master mix (300 fmol of large pCM143 
fragment and 900 fmol of the smaller pCM149 insert), incubated at 50 °C 
for 2 hand heat-killed at 79 °C for 20 min. The assembly was dialysed on 
Millipore VSWP 0.025-um membrane filters for 60 min and transformed 
into electrocompetent CJM2044 cells. 

Unlike the treatment of the initial larger library, this library was not 
subjected to a low-magnesium selection step. Immediately after con- 
struction, this library underwent sort-seq, as described in ‘Library selec- 
tion and sort-seq’. 


Illumina data processing 
The frequency of each mutant in each bin was calculated by taking the 
fraction of reads ina given bin that corresponded toa given sequence, 
normalized by the fraction of cells in that given bin (Supplementary 
Information). All sort-seq plots display the mean frequencies in each 
bin across three replicates, and error bars indicate the s.d. Gaussian 
functions were fit to each distribution (in log,,(YFP units)), from both 
the on and off sorts (SciPy optimize package). Variants with fewer than 
25 total reads were discarded before fitting. Poor Gaussian fits have high 
variances on the estimated parameters. Thes.d. error onthe estimated 
log(YFP) mean (0,,,) was used as a metric to filter poorly fit sequences: 
sequences were removed if Of: on + Orit, or > 2. In total, 10,595 unique vari- 
ants passed these filters. Fold-induction values were calculated as the 
ratio of the fit means between the induced and uninduced states: 11,,/ Lor 
During the analysis of the second, combinatorial library (Fig. 3a), 
the fold induction was calculated for the most-frequent nucleotide 
sequence that represented each amino acid sequence. For visualization 
and orthogonal set design, fold inductions were bounded between 1 and 
20. Individually tested mutants generally did not surpass the sensitiv- 
ity of wild type PhoQ-PhoP, which displayed 20-fold induction during 
sort-seq; this suggests that signal above 20-fold may be due to noise. 
The axes of the matrix in Fig. 3a were clustered hierarchically using the 
WGPMC method (Scipy). During clustering, matrix entries that lacked 
data were assigned the mean value of all other entries. 


Analysis of the sensitivity of sort-seq quantification to read count 
To assess the quality of sort-seq-based quantification for variants with 
lower read coverage, variants with high read coverage (2,000-10,000 
total reads) were downsampled to simulate low read coverage and fed 
through the sort-seq analysis pipeline (Extended Data Fig. 3). For each of 


these high-coverage variants, simulated data were produced by down- 
sampling 100 independent times by the factors indicated in Extended 
Data Fig. 3. Simulated read coverage was generated by sampling (with 
replacement) from the original reads of each variant up to the desired 
read coverage. Simulated reads were then subjected to the same Gauss- 
ian fitting protocol as described in ‘Illumina data processing’. As in the 
original analysis, poor fits (Of, on+ Orit, of > 2) Were discarded. All simulated 
variants were classified as functional or non-functional, on the basis of 
the fold-induction values of the original high-coverage variants from 
which they were sampled (Extended Data Fig. 3c, e). False-positive rates 
(Extended Data Fig. 3d) and false-negative rates (Extended Data Fig. 3f) 
were then calculated as a function of read coverage by computing the 
fraction of simulated variants that were misclassified. 


Orthogonal set design 

Orthogonal sets of PhoQ* and PhoP* variants of up to seven pairs in size 
were identified by systematically scanning all PhoQ*-PhoP* permuta- 
tions within the matrix in Fig. 3a and Supplementary Table 4. Larger 
orthogonal sets were identified using a greedy search algorithm (Sup- 
plementary Information). The 5 x 5 orthogonal set described in Fig. 4e 
was further optimized by testing single-point mutants of the single PhoP* 
variant (VHSCL) that initially displayed crosstalk with a noncognate 
PhoQ* variant. Each of the five PhoP* specificity residues was replaced 
independently with an NNK codon to generate all possible single-point 
mutants (XHSCL, VXSCL, VHXCL, VHSXL and VHSCX, in which X is any 
amino acid specified by the NNK codon). These mutants were cotrans- 
formed with the cognate PhoQ* variant (SCEHLI) into CJM2044 and 
grown overnight in M9 medium containing O mM MgSO, to remove 
nonfunctional variants. After plating the surviving strains on LB agarose 
plates, 48 clones were tested for Mg”* induction (see ‘Flow cytometry 
characterization’). The pCM143-PhoP* plasmids from the 24 clones 
with the strongest induction were purified and cotransformed with the 
noncognate PhoQ* variant (AGGCYF) into CJM2044. Mg”* induction 
was measured by cytometry and the eight clones displaying the high- 
est cognate/noncognate induction ratio were selected for testing with 
all five PhoQ* variants. Two of these eight clones were PhoP*(VYSCL), 
which displayed the highest specificity. 


Reconstruction and in vivo characterization of individual PhoQ* 
and PhoP* variants 

Variants were cloned into plasmids pCM143 (PhoP) and pCM149 
(PhoQ) using blunt-end ligation?” and Gibson assembly”®. Combina- 
tions of pCM143 and pCM149 plasmids were cotransformed into strain 
CM2044. Colonies were grown overnight in M9 containing 2mM MgSO, 
and induced as described in ‘Flow cytometry characterization’ with M9 
containing either 10 1M or 50 mM MgSO,,. After 6h, cultures were diluted 
1:50 into cold PBS containing 0.5 g/l kanamycin, and fluorescence was 
measured ona Miltenyi MACSQuant VYB. The fold-induction values of 
individually tested variant pairs were generally smaller than those meas- 
ured by sort-seq, probably owing to differences between the Miltenyi 
cytometer and BD Aria sorter; however, the two measurements were 
highly correlated (Pearson R= 0.91) (Fig. 1d). 


Purification of two-component signalling proteins and in vitro 
phosphotransfer assays 

Expression and purification of PhoQ* and PhoP* variants, and phospho- 
transfer experiments, were carried out as previously described”. PhoP* 
was purified fused to a His,-Trx tag, and the cytoplasmic region of PhoQ* 
(residues 238-486) was fused to a His,-MBP (maltose-binding protein) tag. 
For phosphotransfer reactions, the kinase was autophosphorylated for Lhat 
30 °Cwith [y-”P]ATP (Perkin Elmer) before being combined with PhoP* ata 
1:8 ratio (10-pl reactions contained 14M PhoQ* and 8 uM PhoP*). Reactions 
were stopped at appropriate times by adding 4x Laemmli buffer with 8% 
2-mercaptoethanol. This process enabled monitoring ofboth phosphotrans- 
fer and phosphatase activities between PhoQ* and PhoP* variants (Fig. 2b). 


For PhoQ* variants for which in vitro autophosphorylation was not 
observed, phosphatase activity was assayed by mixing a given PhoQ* 
variant with a PhoP* variant that was phosphorylated using wild-type 
PhoQ. This was achieved by incubating 8 uM PhoP* for 1h at 30 °C with 
[y-?P]ATP and 1 uM wild-type PhoQ, which promiscuously phosphoryl- 
ates—but does not dephosphorylate—most PhoP variants. After gener- 
ating phosphorylated PhoP*, 1 uM of the PhoQ* variant was added and 
samples taken and reactions stopped. 

Phosphatase activity of PhoQ* variants with respect to other response 
regulators was measured with a similar assay. Twelve F. coli response 
regulators were selected for their ability to be stably phosphorylated 
in vitro by a cocktail of six E. coli histidine kinases (CreC, RstA, PhoR, 
PhoP, EnvZ and CpxA, each at 250 nM). After 2h of pre-incubation with 
radiolabelled ATP and this kinase cocktail, each regulator was combined 
with 2 uM PhoQ or PhoQ* variant. Reactions were stopped at 0, 60 and 
120 min by adding 4x Laemmli buffer with 8% 2-mercaptoethanol. 

Response regulators for phosphotransfer profiles were purified as 
done for PhoP*. Each was fused to an N-terminal His,-Trx tag, expressed 
in BL21(DE3) cells and purified ona Ni?*-NTA column’. Conditions for 
phosphophotransfer profiles were also identical to above conditions; 
10 pl reactions containing 1 1M [y-”P] autophosphorylated PhoQ* and 
8 uM response regulator were generated and stopped after 5 or 60 min. 
Gel images were analysed using quantified with Image). 


RNA-seq 

Cultures were grown overnight in M9 medium containing 2mM MgSO, 
and then diluted 1:25 into fresh M9 containing 2mM MgSO, and grown 
for 2hto reach OD, of about 0.5. For the off condition, 1 ml of cells was 
diluted into 2 ml M9 containing 74 mM MgSO, fora final concentration of 
50mMMgSO,,. For the on condition, 2 ml of cells were pelleted, washed 
twice with M9 + 10 uM MgSO,, resuspended in M9 + 10 1M MgSO,, and 
then1ml of cells was diluted into 2 ml M9 + 10U.M MgSO,,. The induction 
of AQ,* strains was identical, except that cells were induced for 30 min 
in M9 (2mMMgSO,) containing either 0 pM trans-zeatin (off condition) 
or 1M trans-zeatin (on condition). 

RNA was collected as previously described”. After 30 min of growth, 
cells from each condition were collected by adding 1.8 ml culture to 200 
UM cold stop solution (95% ethanol, 5% acid buffered phenol, 4 °C). The 
mixture was centrifuged for 30 s at 13,000 r.p.m. ona benchtop centri- 
fuge, and the supernatant was removed and the pellet flash-frozen in 
liquid nitrogen and stored at —80 °C. To extract RNA, TRIzol (Invitrogen) 
was heated to 65 °C, added directly to the pellet and incubated at 65 °C 
for 10 min with shaking at 2,000 r.p.m. (Eppendorf Thermomixer). The 
mixture was frozen at —80 °C for at least 10 min. After thawing, cells 
were centrifuged at 15,000 r.p.m., 4 °C for 5 min and the supernatant 
was removed into 400 pl ethanol. The mixture was applied to a Direct- 
Zol spin column (Zymo) and centrifuged for 30 s at 13,000 r.p.m. The 
columns were washed with DirectZol RNA prewash buffer twice (400 pl) 
and RNA wash buffer (700 pl) once before eluting in 90 pl DEPC water. 
Then 10 pl of 10x Turbo DNase buffer and 2 pl Turbo DNase (Invitrogen) 
were added to the eluant. The mixture was digested at 37 °C for 20 min, 
followed by the addition of 2 pI DNase and another 20 min incubation. 
The total volume was brought to 200 pl with DEPC water and combined 
with 200 pl acid-phenol:chloroform (IAA, Invitrogen), vortexed and 
centrifuged for 10 min at 21,000g and 4 °C. The top (aqueous) layer 
was extracted and ethanol was precipitated in 20 pl NaOAc (3M), 2 pl 
GlycoBlue (Invitrogen) and 600 ul cold ethanol. Precipitation mix was 
incubated at —80 °C for more than 4 h before centrifuging for 30 min 
at 21,000 gand 4 °C. The pellet was washed twice with 500 pl cold 70% 
ethanol, then air-dried and resuspended in 50 pl DEPC water. RNA integ- 
rity was validated on a 6% TBE-urea acrylamide Novex gel (Invitrogen) 
and yield was quantified by NanoDrop spectrophotometer. Ribosomal 
RNA was removed with the RiboZero rRNA Removal Kit for Bacteria 
(Illumina). RNA was fragmented and cDNA libraries were prepared at the 
MIT BioMicro Center sequencing core using the KAPA RNA HyperPrep 


Kit (Roche) and sequenced onan Illumina HiSeq. Reads were mapped to 
the £. coligenome and plasmids with bowtie2 using default parameters”*. 

To determine whether selected PhoQ*-PhoP* variants interfered with 
other two-component signalling pathways, we examined whether any 
other response regulator or histidine kinase genes were upregulated 
transcriptionally when PhoQ* variants were activated by low levels of 
Mg?*. We calculated the fold change in expression of each two-compo- 
nent regulatory gene as a ratio of reads under conditions of low and 
high Mg”* (Extended Data Fig. 7b). rstA and rstB are part of the PhoP 
regulon, and are directly upregulated by wild-type PhoQ-PhoP and 
most variant pairs. To quantify how these fold changes compared to 
wild-type PhoQ-PhoP, we calculated the ratio of the fold change in each 
gene tothe geometric mean of the fold change in the same gene for the 
two wild-type replicates (Extended Data Fig. 7c). To assess whether any 
of the two-component signalling genes were significantly upregulated, 
we calculated the z-score of each ratio of fold changes and conducted a 
one-tailed test to compute P values (Extended Data Fig. 7d). After using 
aBonferroni correction for multiple hypothesis testing, no genes were 
found to be significantly upregulated (P< 0.05). 


Identification of two-component signalling proteins and 
generation of force-directed graphs 

The RefSeq Prokaryotic Genomes database of 5,506 bacterial genomes 
(September 2017) was downloaded from NCBI. The database was scanned 
for histidine kinases and response regulators using jackhmmer® (e value 
cut-off = 0.01) with all two-component signalling proteins from F. coli 
used as queries. The combined lists of histidine kinase and response 
regulator hits were aligned with hmmalign® to the PFAM hidden Markov 
models for HisKA and Response Reg domain families, respectively. 
Columns in the multiple sequence alignment with greater than 80% 
gaps were eliminated, and sequences with greater than 50% gaps were 
discarded. Histidine kinases that lack the catalytic histidine and response 
regulators that lack the catalytic aspartate were removed. Proteins that 
contain both the histidine kinase DHp domain and a response regulator 
receiver domain were discarded to avoid ambiguity. Histidine kinases 
and response regulators were then labelled as exclusive pairs if they 
were (i) within 20 genes in the genome with no other histidine kinase or 
response regulator gene between, (ii) on the same strand and (iii) closer 
to no other potential histidine kinase or response regulator partner 
(with distance defined as the number of genes between partners). The 
sequences of paired histidine kinases and response regulators were 
concatenated and the multiple sequence alignment was then reduced 
to the 11 positions mutated in this study. 

The force-directed graph was generated using the Gephi network 
visualization package”. To construct anetwork, the 85,782 co-operonic 
histidine kinase-response regulator pairs identified by HMMER in bacte- 
rial genomes were combined with the functional mutant sequences from 
the PhoQ-PhoP dual library that had fold-induction values >18 and the 
mutant variants within the characterized 5 x 5 orthogonal set (Fig. 3c, e). 
These sequences were treated as nodes and were connected by edges 
if the pairwise alignment score for the 11 specificity residues (using the 
BLOSUM6@2 scoring matrix) exceeded a threshold score of 20. If more 
than 40 edges were connected toa node, only the top scoring 40 edges 
were kept. If no edge scoring above 20 connected a node that node 
retained its top-scoring edge, despite that edge being below the BLO- 
SUM62 threshold. A final model of about 86,000 nodes and 2.5 million 
edges was loaded into Gephi and visualized using the Force-Atlas-2 tool*®. 


Construction of AHK4—-PhoQ chimaera 

Chimeric histidine kinases sensors were made using a variation of the 
‘primer aided truncation for the creation of hybrid proteins’ (PATCHY) 
strategy”. The N-terminal region of AHK4 (residues 1-475) was cloned 
downstream of the P,,- promoter ona p15A/kan® vector. This plasmid was 
amplified by PCR with primers containing Sap sites to enable insertion of 
the PhoQ kinase domain. Five sets of primers allowed five possible junction 


Article 


sites within AHK4 (residues A466, A468, A469, A472 and A478) with identi- 
cal GCG overhangs. PhoQ (pCM149) was amplified with 32 distinct primers 
(also containing Sapl sites) to generate 32 C-terminal truncations beginning 
upstream of the DHp domain (residues 213-224 and 257-276). PCR prod- 
ucts were gel-purified (Zymo), then combined in a 50-1 ligation reaction 
containing 400 U of T4 ligase (NEB), 20 U Sap1 (NEB), 100 fmol pooled 
AHK4 PCR products and 500 fmol pooled PhoQ PCR products. The reac- 
tion was cycled 50 times between 37 °C (2 min) and 16 °C (3 min) to drive 
assembly to completion, heat-killed at 50 °C (20 min) and 80 °C (20 min) 
and dialysed on Millipore VSWP 0.025-um membrane filters (60 min). This 
small library of 160 possible fusions was transformed into electrocompe- 
tent CJM2044 cells containing PhoP ona plasmid (pCM143). 

To enrich for chimeras responsive to trans-zeatin we used Mg”" starva- 
tion as aselection. An overnight culture of the library in M9 medium was 
induced for 1h (M9 containing 21 nM aTc, 1 uM trans-zeatin), washed 
three times in M9 containing no MgSO, and diluted 1:10 into 100 ml M9 
containing no MgSO,, 21nM aTc, 1 uM trans-zeatin. After 4h, 500 pl of 
culture was plated on LB. Ninety-six colonies were picked and screened 
for trans-zeatin-dependent YFP expression. 


GCN4-DHp fusions to test phosphatase buffering against 
crosstalk in vivo 

To generate cytosolic variants of PhoQ locked ina phosphatase state, we 
followed a previously described strategy** and fused GCN4 (MKQLED- 
KVEELLSKNYHLENEVARL) N-terminal to the DHp domain of PhoQ. 
pCM149 (lacUV5-phoQ, kanR, p15A) was amplified with 24 distinct prim- 
ers (containing Sapl sites) to generate 24 C-terminal truncations begin- 
ning upstream of the DHp domain (residues 222-225 and 257-276). The 
N terminus of PhoQ was replaced by GCN4 in each of these plasmids, 
removing the transmembrane and sensory domains. Each GCN4 fusion 
plasmid was transformed with pCM143 (phoP) and pCM150 (Prers-YfP) 
into TIM175 and tested by standard Mg” induction (see ‘Flow cytometry 
characterization’) for activity. As expected, some variants displayed 
constitutive high YFP (presumably a locked kinase conformation) or 
constitutive low YFP (presumably alocked phosphatase conformation) 
and stepwise amino acid insertions displayed a periodicity of these 
phenotypes (Extended Data Fig. 8f). One of these fusions (fusion-266) 
displayed even lower constitutive YFP values than PhoQ with mutations 
in the ATP cap (R434M, R439M and Q442M; pCMI180) or ATP pocket 
(N385L, N389L, K392M and Y393F; pCM179)”. 

Totest the ability of acognate phosphatase to suppress crosstalk from 
anoncognate kinase, we used a three-plasmid setup: pCM874 (reporter 
plasmid pCM150 with PlacUV5-PhoP,,* inserted), pCM149 (PlacUV5- 
PhoQ,,;) and pCM873 or pCM898 (Ptet-GCN4-fusion266-PhoQ,,, or 
-PhoQ,;*, respectively). Because PhoP* has been moved froma low-copy 
to medium-copy plasmid, crosstalk between PhoQy, and PhoP* is proba- 
bly exacerbated by overexpression, as noted by the high level of induction 
seen in Extended Data Fig. 8g before aTc is added. However, induction of 
the GCN4—fusion-266-PhoQ,,* phosphatase effectively eliminates this 
crosstalk (Extended Data Fig. 8g, right). Induction of the noncognate 
phosphatase GCN4—fusion-266-PhoQ,,;* does not relieve this crosstalk. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Datasets generated during this study have been deposited in GEO. Raw 
reads and processed sort-seq analysis of each mutant can be found under 
accession numbers GSE120780 (degenerate PhoQ-PhoP library) and 
GSE120786 (combinatorial library of 79 PhoQ*-PhoP* variants). Raw 
reads and reads per kilobase of transcript per million mapped reads 
(RPKM) for all £. coli genes from RNA-seq are deposited with accession 
number GSE128611. 


Code availability 


Python scripts for analysis are available at https://github.com/mcclune/ 
nature2019. 
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Extended Data Fig. 1| Bioinformatic and coevolutionary analysis of two- 
component signalling systems. a, Schematic illustrating the challenge of 
identifying HK*-RR* pairs that are orthogonal to all endogenous histidine 
kinases and response regulators. For both histidine kinases and response 
regulators, the specificity-determining residues define a finite sequence space. 
The specificity-determining residues of each histidine kinase determine the set 
of response regulators with which it can interact. These sets, or nichesin 
sequence space, are depicted as ovals, and each cognate response regulator is 
represented by a black dot (bottom left). A similar representation is shown for 
each response regulator and the set of histidine kinases with which it can 
interact (top right). The two sequence spaces are connected, as depicted with 
coloured cones for a single histidine kinase-response regulator pair. The 
establishment ofa new signalling pathway that is orthogonal to existing systems 
requires that the two new proteins are compatible with each other, but occupy 
regions of histidine-kinase and response-regulator specificity space that are 
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incompatible with all of the paralogues that are already present. b, Schematic 
summarizing the endogenous two-component pathways in £. colito whichanew 
orthogonal pathway must avoid crosstalk. c, Diagram of the DHp domain of the 
histidine kinase TM0853 (blue) in complex with its cognate response regulator 
RRO468 (green). Residues that dictate specificity, and which were randomized 
in our libraries, are space-filled in orange (kinase) and red (substrate). d, Plot 
summarizing the number of histidine kinases and response regulators in 
bacterial genomes. e, Visualization of the GREMLIN model, representing the 
coevolutionary dependencies between the residues of cognate histidine kinases 
and response regulators. Blue nodes indicate PhoQ residues, green nodes 
indicate PhoP residues and the darker nodes are the 11 residues that were 
randomized in the dual PhoQ-PhoP library. Edge widths indicate the strength of 
the coevolutionary signal, and the node size of each residue represents the total 
coevolutionary signal to residues on the other protein. 
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Extended Data Fig. 2| Summary statistics for the dual PhoQ-PhoP library. 

a, Schematic summary of library design. b, Left, histogram of the read counts for 
the pre-selection PhoQ-PhoP library. The vast majority of reads are unique, 
which indicates that the library size is larger than Illumina sequencing coverage 
and no variants are overrepresented. Right, histogram of the read counts for one 
replicate of the PhoQ-PhoP library after overnight growth in low Mg”* 
conditions. c, Counts of cells sorted into each bin, for each replicate and growth 
condition. d, The cells sorted into each bin were grown overnight, diluted back 
to mid-exponential phase, shifted to medium with 10 pM Mg” and their YFP 
levels were verified by flow cytometry. n=2 independent biological replicates. 
e, Asind, but with cells retained in medium with 50 mM Mg”. n=2 independent 
biological replicates. f, Scatter plots displaying the correlations between the bin 
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10,595 variants with sufficient sequencing coverage and fit quality (Methods) to 
be included in the analysis. h, Scatter plot displaying the Pearson correlation 
between the YFP fold induction measured by sort-seq and that measured 
individually by flow cytometry for 32 individual variant pairs. i, Sequence logos 
summarizing the amino acid frequencies at each position varied in the pre- 
selected library (top), set of pairs with >20-fold induction (middle) and all native 
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Extended Data Fig. 3 | Sensitivity of sort-seq pipeline to read coverage. 

a, Histogram of the total read counts for the 10,595 variants with sufficient 
coverage and fit quality to be included in analysis. b, Aschematic example of the 
downsampling method used to simulate low read coverage using variants with 
high read coverage. c, Fold induction of high-coverage functional PhoQ*-PhoP* 
variants (fold induction >20) after simulating lower read coverage using 
downsampling and refitting. n=100 independent downsampling simulations. 
d, A quantification of how read coverage inc affects the calculated fold 
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Extended Data Fig. 4 | Additional in vitro characterizations of selected 
PhoQ*-PhoP* pairs. a, Phosphotransfer reactions for PhoQ*-PhoP* variants, as 
well as PhoQ* with wild-type PhoP (middle) and wild-type PhoQ with PhoP* 
(right). These experiments were repeated independently twice with similar 
results. b, Quantification of phosphatase activity for PhoQ,*,PhoQ,*,PhoQ,,* and 
PhoQ,,*, asin Fig. 2c. Lines indicate mean +s.d. from n=3 biological replicates. 
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in Fig. 2d, but for 60 minand 5 min instead of just 5 min. In each case, the kinase 
was autophosphorylated and then incubated for 5 or 60 min individually with 
each of 27 response regulators from E. coli, and its selected PhoP* variant, 
followed by SDS-PAGE and autoradiography. The position of the 
autophosphorylated kinase and the approximate positions of any 
phosphorylated regulators are indicated by arrowheads on the left. These 
experiments were repeated independently twice with similar results. 
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was autophosphorylated, then incubated for 5 or 60 min. with its cognate 
response regulator, with wild-type PhoP, or with one of eleven PhoP* variants, 
and analysed as in Extended Data Fig. 4. n=1independent experiment. 
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Extended Data Fig. 7| RNA-seq analysis of strains containing PhoQ*-PhoP* 
variants. a, RNA-seq analysis of strains containing wild-type PhoQ-PhoP or the 
indicated variant pair, measured after 30 min with 10 uM or 50 mM Mg”. Each 
strain displays a similar Mg*-limitation induction of three genes (mgtA, mgtL 
and mgrB) inthe PhoP regulon, as well as the PhoP-dependent reporter gene yfp. 
b, The expression change of each response-regulator and histidine-kinase gene 
in F. coli. Colours represent the fold change (expressed in log,) inthe response to 
low extracellular Mg”. rstA and rs¢B are part of the PhoP regulon, and so show 
changes in transcription after activation of PhoQ or of several PhoQ* variants. 
Otherwise, most two-component pathways show little induction by the wild- 
type PhoQ-PhoP and the PhoQ*-PhoP* pairs. c, The same dataas inb, but with 
the fold change of each variant pair normalized to the fold change seen with wild- 
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type PhoQ-PhofP, in which the latter is the geometric mean of two wild-type 
replicates. d, Pvalues of the z-score calculated for each value inc. For each gene 
and each variant, z-scores represent the deviation of the variant/wild-type ratio 
of that gene, when compared to the distribution of the variant/wild-type ratio of 
every gene. Using all £. coligenes with reads across multiple samples (n =3,477), 
Pvalues were calculated with a one-tailed z-test to identify genes that induced 
more strongly with the variant pairs than with the wild-type pair. The statistical 
significance threshold after correcting for multiple hypothesis testing is 
indicated onthe colour legend that encodes the P values. None of the other two- 
componentsignalling genes in E. coliis significantly induced by the variant 
PhoQ*-PhoP* pairs that we tested. 
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combinatorial library in Fig. 3a. Correlations between bin frequencies of 
individual variant pairs measured in independent replicates. R’ values indicate 
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Incancer, recurrent somatic single-nucleotide variants—which are rare in most paediatric 
cancers—are confined largely to protein-coding genes’ °. Here we report highly recurrent 
hotspot mutations (r.3A>G) of U1 spliceosomal small nuclear RNAs (snRNAs) in about 
50% of Sonic hedgehog (SHH) medulloblastomas. These mutations were not present 
across other subgroups of medulloblastoma, and we identified these hotspot mutations 
in U1snRNA in only <0.1% of 2,442 cancers, across 36 other tumour types. The mutations 
occur in 97% of adults (subtype SHH) and 25% of adolescents (subtype SHHa) with SHH 


medulloblastoma, but are largely absent from SHH medulloblastoma in infants. The U1 
snRNA mutations occur in the 5’ splice-site binding region, and snRNA-mutant tumours 
have significantly disrupted RNA splicing and an excess of 5’ cryptic splicing events. 
Alternative splicing mediated by mutant U1 snRNA inactivates tumour-suppressor genes 
(PTCHI) and activates oncogenes (GL/2 and CCND2), and represents a target for therapy. 
These U1 snRNA mutations provide an example of highly recurrent and tissue-specific 
mutations of anon-protein-coding gene in cancer. 


Medulloblastoma, a cerebellar neuronal cancer, comprises four molecu- 
lar subgroups (WNT, SHH, group 3 and group 4), each of which has its 
own distinct clinical, transcriptomic and genetic make-up* °. Each of 
these four molecular subgroups can be further subdivided into sub- 
types; for SHH medulloblastoma, these comprise SHHa, SHHB, SHHy 
and SHH8’. Noncoding single-nucleotide variants have recently been 
discovered inthe promoter regions of TERT and a handful of other loci, 
which has given impetus to the careful examination of noncoding seg- 
ments®”. Thus, we sought to explore the genomic landscape of medul- 
loblastoma with a particular focus on noncoding regions. We analysed 
whole-genome sequencing data of 114 medulloblastomas, and observed 
arecurrent hotspot mutation of noncoding U1 snRNA genes in 10 out of 
114 cases (8.8%) (Fig. la, Extended Data Fig. 1, Supplementary Tables 1, 
2, Methods). Hotspot mutations of U1 snRNA genes occur in the third 


nucleotide (r.3A>G), and are restricted to SHH medulloblastoma. These 
hotspot mutations are localized within the 5’ splice-site recognition 
sequence, which has been ultra-conserved in eukaryotes through nearly 
one billion years of evolution (Fig. 1b, Extended Data Fig. 2a). The human 
reference genome (hg19) has four annotated U1 snRNA genes (RNUI-1, 
RNUI-2, RNU1-3 and RNU1-4) and three ‘pseudogenes’ (RNU1-27P, RNUI- 
28P and RNVUI-18), all of which encode completely identical 164-base- 
pair transcripts. In addition, there are over 100 U1 snRNA pseudogenes 
spread across the genome, which highly complicates their identifica- 
tion by mutation callers owing to the inability to align short reads to 
any individual U1 snRNA gene” (Extended Data Fig. 3). We remapped 
sequence reads that permitted multimapping, and successfully detected 
the U1lsnRNA mutation in five additional cases (Methods). We validated 
hotspot U1 snRNA mutations in an additional 40 out of 227 cases of 


Alist of affiliations appears at the end of the paper. 
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medulloblastoma from the International Cancer Genome Consortium 
(ICGC) (Supplementary Tables 2-4). We also detected recurrent hotspot 
mutations of the U1] snRNA gene (RNUI1) at the fifth nucleotide (r.5A>G), 
inthe highly conserved 5’ splice-site recognition sequence (in a total of 
4 out of 341 cases) (Extended Data Fig. 2b-d, Supplementary Table 2). 
Taken together, 51% (56 out of 109) of SHH medulloblastomas have at 
least one mutation in U1 or U11 snRNA (Fig. 2). These snRNA mutations 
significantly co-occur with mutations of the TERT promoter and DDX3X 
(Supplementary Tables 5, 6). We assessed the U1 snRNA r.3A>G muta- 
tions across 2,442 samples drawn from 36 cancer histologies of the 
ICGC, and found such a mutation in only one sample (0.04%)—a lone 
sample of pancreatic ductal adenocarcinoma (Supplementary Table 7). 
We conclude that U1 snRNA r.3A>G mutations are both highly recurrent 
in, and extremely specific to, SHH medulloblastoma. 


Type 
Histology 
Age 
Gender 


We validated the U1 snRNA r.3A>G mutations in an additional 159 
cases of SHH medulloblastoma using allele-specific PCR. We detected 
mutations in the RNU1-27P and/or RNU1-28P genes (which were con- 
firmed by Sanger sequencing) that were not identified by whole-genome 
sequencing (Extended Data Fig. 4a, b, Supplementary Table 8, Methods). 
Combining the results of whole-genome sequencing and allele-specific 
PCR, we found that U1snRNA r.3A>G mutations were largely restricted 
to cases of SHH medulloblastoma in adulthood (SHH6, present in 97% 
of cases) and adolescence (SHHa, present in 25% of cases), and absent 
from those in infancy (Fig. 3a, b). This remains true if accounting only 
for age, and not for molecular subtype. Indeed, most patients with SHHa 
with 7P53 mutations also have U1 snRNA r.3A>G mutations (Fig. 3c). 
Both broad and focal somatic copy-number variations are divergent 
between SHHa with wild-type U1 snRNA, SHHao with mutant Ul snRNA 
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Fig. 2 | Mutational repertoire of snRNA-mutant SHH medulloblastomas. 
Genomic landscape of mutations in SHH medulloblastomas (n=109), with and 
without U1 or U11 mutations. Odds ratios (red dots) of coexistence of Uland 
U11snRNA mutations with other somatic events are shown with their 95% 
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confidence intervals. Arrowheads represent values that are out of the axis range. 
Significantly correlated mutations are denoted in red (false discovery rate (FDR) 
< 0.1, asymptotic P values from odds-ratio tests (Hy: odds ratio =1, Methods) with 
Benjamini-Hochberg adjustment for multiple testing. 


a b Infants (n = 74) Children (n = 53) Adolescents (n = 32) Adults (n = 95) Fig. 3| Clinical and cytogenetic features of SHH 
= 1.0, 9 9 Mutant (0) Mutant (0.21) Mutant (0.56) Mutant (0.98) medulloblastomas with mutant UisnRNA, 
a ] a, Frequency of U1lsnRNA mutations across subtypes 
° 084 of SHH medulloblastoma. NA, not available (samples 
= ae | ; for which the subtype is unknown). b, Top, frequency 
s 55.2 % of UlsnRNA mutation by age group (n=74 for infants, 
: 0.4 n=53 for children, n=32 for adolescents, n=95 for 
cy 25.0 % P = 8.39 x 10° P = 3.06 x 10° = 1.6 x 10? adults). Bottom, age distribution by subtype (n= 47 
g 02 4 : : for SHHa, n=28 for SHHB, n=34 for SHHy, n= 63 for 
8 01 are 22% i 2 50 + . —_ SHH6, n= 82 for unknown subtype) and U1lsnRNA 
ay Ro Ap ae ss A\ F 40 f . — mutational status (n =122 for mutant, n=132 for wild 
= : 3 type (WT)). Pvalues were calculated by two-sided 
é SHH nes 3 Si ; . , Wilcoxon rank-sum test. In the box plots, centre lines 
Said WW) SHAe Ut Has SARI HAS 8 20 : : : show data median; box|imits indicate the 
(n = 39) (n= 13) (n = 63) 49 Co. = a interquartile range (IQR) fromthe 25th to the 75th 
TPS3 mut (9) 7p53 WT (3) TP53 mut (5) 2 1 — = a percentiles; and lower and upper whiskers extend 1.5x 
0 — the IQR. Outliers are represented by individual points. 
z_ _ s ae e = c, Frequency of 7P53 mutation in tumours with wild- 
EA <8 Ee z5 z8 E= 58 ES Za type a ae peace he eas 
rit cirlt ge rt else survival of patients with SHHa, stratifie 
TP53 WT (30) P53 mut (10) TPs3 WT (68) ad Sa a a ae paufatlonalstals of UlsnRNA (d) (n=10 rar ian 
d 2 2 f 2 n=27 for wild type), 7P53 (e) (n=15 for mutant, n=22 
8 1.0 8 1.0 8 1.0 for wild type) or both (f) (n=9 for both mutant, n=1 
g 5 $ for UlsnRNA mutation only, n=6 for TP53 mutation 
g o8 g 08 8 = only, n=21 for both wild type). Pvalues were 
50.6 E 0.6 esti 5 0.6 determined using the two-sided log-rank test. +, 
2 % ~ P53 mut g —U1 mut, 7P53 mut ~—- censored cases. 
£04 = 04 = 04 -U1 mut, TP53 WT 
5 “U1 SANA mut = 5 § “UH WE Tess Wt 
3 0.2 g 02 Th 8 0.2 
> 9 107 y 0! P=5.09 x 10° Ey 0 cL P=5.80x 10% 
7 6246824 02 4 6 8 10 12 14 a 02 4 6 8 101214 
UME 92999 BBM RA 28 8 20 wnkbmes 3 1 0 0 0 oo 
Progression-free survival (years) Progression-free survival (years) U1mutonly1 1 0 00 0 0 0 
TP53mutonly 6 2 2 2 00 00 
res Wheat 18116 5 3 2 1 


Progression-free survival 


and SHH6 with mutant U1 snRNA, which provides support for a model 
in which these cancers follow different genetic pathways to transforma- 
tion (Extended Data Fig. 4c, d, Supplementary Tables 9, 10). Ananalysis 
of focal copy-number variations demonstrates that SHHa tumours 
with wild-type U1 snRNA have an increased incidence of copy-number 
variations that encompass several oncogenes and tumour-suppressor 
genes, including MYCN, CCND2 and PPMI1D. 

A univariate log-rank analysis of both progression-free survival and 
overall survival revealed that, within SHHa tumours, both U1 snRNA 
r.3A>G and 7P53 mutational status are separately associated witha sig- 
nificantly poor outcome (Fig. 3d-f, Extended Data Fig. 4e-i). However, 
ina multivariate Cox regression analysis, 7P53 mutations alone are no 
longer significant for progression-free survival, whereas U1 snRNA 
r.3A>G mutations confer a very strong risk of relapse (UI snRNAr.3A>G 
mutation: hazard ratio 5.51, 95% confidence interval 1.15-26.35, P=0.03; 
TP53 mutation: hazard ratio 3.01, 95% confidence interval 0.55-16.65, 
P=0.21). A similar trend was observed for overall survival (U1 snRNA 
r.3A>G mutation: hazard ratio 3.72, 95% confidence interval 0.74-18.87, 
P=0.11; TP53 mutation: hazard ratio 2.70, 95% confidence interval 
0.46-15.88, P= 0.27). This suggests that, within SHHa, the combination 
of botha 7P53 mutation and a U1 snRNA r.3A>G mutation is associated 
with an extremely poor prognosis. 

Intron-centric alternative splicing analysis using LeafCutter con- 
firms that mutant U1 snRNA variants of both SHHa and SHH6 have 
2.5-3 times more alternative 5’ cryptic splicing events than do SHH 
medulloblastomas with wild-type U1 snRNA“ (Extended Data Figs. 5a, b, 
6a-c, Supplementary Table 11). The Ul snRNA r.3A>G mutations would 
be predicted to affect the recognition of the sixth intronic nucleotide 
from the 5’ splice site; indeed, cryptic S’ splice sites recognized in SHH 
medulloblastoma with mutant U1 snRNA demonstrate enrichment of 


years) 


a dominant C base, as opposed to the T base observed in tumours with 
wild-type U1 snRNA (Extended Data Figs. 5c, 6d, e). Pathway analysis of 
differentially expressed transcripts between SHH medulloblastoma 
with wild-type versus mutant U1 snRNA demonstrates an increase in 
nonsense-mediated decay, which is consistent with the destruction 
of aberrantly spliced transcripts (Extended Data Fig. 7a). To validate 
the effect of Ul snRNA mutations, we transfected wild-type or mutant 
U1 snRNA r.3A>G vectors into human embryonic kidney 293T cells, 
and examined the effects on splicing. Intron-centric analysis clearly 
demonstrates an enrichment of a C base at the sixth intronic position 
when using the mutant U1 snRNA vector, and a considerable increase 
in the incidence of cryptic 5’ splicing events that do not overlap with 
those of SHH medulloblastoma with wild-type U1 snRNA (Extended 
Data Fig. 7b-d, Supplementary Tables 12, 13). 

Clustering on the basis of significant alternative splicing events is 
clearly driven by U1 snRNA mutational status (Extended Data Fig. 7e, 
Methods), and tumours with mutant U1 snRNA segregated distinctly from 
those with wild-type U1 snRNA. We conclude that the U1 snRNA r.3A>G 
mutation hasa marked effect on alternative splicing in affected tumours. 

Asacomplementary approach, we conducted exon-centric alterna- 
tive splicing analysis using rMATS”. We observed that SHH medullo- 
blastoma tumours with mutant U1 snRNA have a higher incidence of 
cassette exons than do wild-type U1 snRNA controls (Extended Data 
Figs. 8a—c, 9a, b, Supplementary Table 14). Similar to cryptic 5’ alter- 
native splicing events, the dominant base at the sixth intronic base is 
C (Extended Data Figs. 8d, 9c, Supplementary Table 15). In addition, 
an increase of retained introns is observed in tumours with mutant U1 
snRNA. The 3’ splice-site sequences of missed splice sites in retained 
introns do not have a dominant C at the sixth nucleotide, but instead 
have the canonical T. This latter result suggests a mechanism in which 
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Fig. 4 | Aberrant splicing of Hedgehog signalling genes in SHH 
medulloblastoma with mutant U1 snRNA. a, Overview of cryptic alternative 
splicing of PTCH1, demonstrating the position of a cryptic cassette exon with the 
5’ splice-site sequence. b, Top, Sashimi plots of splicing of PTCH1 in 
representative cases. The bar plot shows counts per million reads (CPM). 
Numbers refer to the number of junctional reads. Bottom, annotated exon 
tracks with genomic positions marked. Junctional reads specific to UlsnRNA 
mutants are in red.c, Scatter plot comparing detected alternatively spliced read 
and total junction reads that shared the 3’ splice site. Jittering was performed for 
both values. d, Per cent spliced in values by SHHa with mutant U1 snRNA, SHH6 
with mutant U1snRNA and SHH medulloblastoma with wild-type U1 snRNA (all 


mutant Ul snRNA r.3A>G not only recognizes alternative 5’ splice sites 
but also inhibits the wild-type U1 snRNA from detecting canonical splice 
sites, which results in their aberrant splicing. The retained intron event 
with the highest per cent spliced in, validated by real-time quantitative 
(q)PCR, occurs in the gene PAX6, (which undergoes frequent somatic 
mutation in SHH medulloblastoma) and a chromatin remodelling gene 
TOX4* (Extended Data Figs. 8e-h, 9d, Supplementary Table 16). The 
retained introns in both genes result in a frameshift, which leads to loss 
of function. These data may support a model in which U1 snRNAr.3A>G 
mutations impede normal splicing, which leads to intron retention and 
to an mRNA frameshift. 

To detect pathogenic alternative splicing, we identified cryptic S’ splic- 
ing events witha C base at the sixth intronic position that are shared by 
mutant U1 snRNA variants of both SHHa and SHHS tumours (Extended 
Data Fig. 9e, Supplementary Table 17, 18). Using both RNA sequenc- 
ing and real-time qPCR, we detected cryptic splicing events with high 
effect sizes in both PTCH1 and GL/2; these events were highly specific 
to Ul snRNA r.3A>G mutation (in both SHHa and SHH6 tumours), as 
compared to wild-type U1 snRNA controls (Fig. 4a—e). PTCH1 is known 
to have at least three different initial exons. Splicing mediated by the 
UlsnRNA r.3A>G mutant results in the inclusion of a cassette exon 
between exon 2 and exon 3, which causes a frameshift, and therefore 
predicted translation from the ATG in exon 3 (Fig. 4f). It has previously 
been reported that loss of expression of the 1,447-amino-acid isoform 
of PTCH1 results in derepression of Hedgehog signalling”. Similarly, the 
U1 snRNA r.3A>G cassette exon in GL/2 is spliced between exon 4 and 
exon 5, which results in a putative GLI2 protein that lacks the repressor 
domain (Extended Data Fig. 10a-f). Physiological GLI2 protein has a 
repressor domain at its amino terminus, and constructs that lack the 
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subtypes of SHH medulloblastoma). Pvalues were calculated using two-sided 
Wilcoxon rank-sum test. e, Box plot of fold changes in expression of the 
alternatively spliced isoform of PTCH1 compared to the wild-type isoform of 
PTCH1in subsets of SHH medulloblastoma, as determined by real-time qPCR. 
Dataare mean +s.d. Pvalues were calculated using two-sided Wilcoxon rank-sum 
test. f, Illustration of canonical isoforms and the cryptic alternative isoform of 
PTCH1. Putative translation start sites are indicated with an arrow. Resulting 
proteins (and size) are displayed for eachisoform. aa, amino acid; UTR, 
untranslated region. In the box plots (d, e), the centre lines show data median; box 
limits indicate the IQR from the 25th to the 75th percentiles; and lower and upper 
whiskers extend 1.5x the IQR. Outliers are represented by individual points. 


amino terminus are much more potent at activating Hedgehog signal- 
ling than the full-length protein’®. 

Alternative splicing of the cell-cycle gene CCND2, a known down- 
stream target of SHH signalling that is recurrently amplified in SHH 
medulloblastoma, is detected in U1 snRNA r.3A>G mutants of SHH& 
but not in SHHa’”"® (Extended Data Fig. 10g-1). Focal amplifications of 
CDKé are highly recurrent in SHHa U1 snRNA r.3A>G mutants, but not 
in SHHa with wild-type U1 snRNA or SHH6 U1 snRNA r.3A>G mutants, 
which suggests convergence on dysregulation of the G1-S cell-cycle 
checkpoint. The CCND2 alternative isoform is prematurely terminated, 
which results in N-terminal sequences in which the PEST domain is pre- 
dicted to be deleted. Deletion of the PEST domain causes resistance 
to protein degradation and impaired export from the nucleus, which 
results in CCND2 accumulating in the nucleus to promote cell-cycle 
progression”. PAXS5, another known tumour-suppressor gene, is affected 
by cryptic 5’ alternative splicing in U1 snRNA r.3A>G mutants (Extended 
Data Fig. 10m-q). SHH medulloblastomas with wild-type and mutant 
U1 snRNA express distinct cryptic isoforms. The cryptic isoform pre- 
sent in SHH medulloblastomas with wild-type U1 snRNA translates the 
complete DNA-binding domain of PAX5. However, the cryptic exon (also 
known as a poison exon”°”) that is present in SHH medulloblastomas 
with mutant U1 snRNA results in a stop codon before the DNA-binding 
domain. Mutations of PAX5 in cancer are typically concentrated in the 
DNA-binding site”. Together, the data relating to the alternative splic- 
ing of PTCH1, GLI2, CCND2 and PAXS support a model in which cryptic 
alternative splicing mediated by mutant U1 snRNA r.3A>G functions as 
a driver in subsets of SHH medulloblastoma. 

AUI1snRNAr.3A>G mutation is the most common single-nucleotide 
variant in medulloblastoma. The restriction of these mutations not 


just to SHH medulloblastoma but to the SHHa and SHH6 subtypes sug- 
gests amodel in which the specific cell of origin, the temporally specific 
microenvironment or co-occurring mutations (that is, of TP53) are nec- 
essary for U1snRNA to contribute to oncogenesis. Although the almost 
universal occurrence of U1 snRNA mutations in SHH6 highly supports 
their role in tumour initiation, proof of the ongoing role of mutant U1 
snRNA r.3A>G in tumour maintenance will await its knockdown ina 
tumour in which it was the initiating genetic event. 

Patients with SHHa witha U1snRNA r.3A>G mutation are an extremely 
high-risk population, who should be prioritized for the development 
of targeted therapies. Drugs that are under development directly tar- 
get the spliceosome, which may show anti-tumour effects in cancers 
with spliceosomal mutations”. Loss of expression of specific genes 
through cryptic splicing or intron retention could create opportuni- 
ties for synthetic lethal approaches. Finally, cryptic splicing in SHH 
medulloblastoma with mutant U1 snRNA leads to a unique form of post- 
transcriptional hypermutation; this would be predicted to result inthe 
expression of numerous cell-surface neo-epitopes that are never seenin 
healthy tissues, and which could be targeted using immunotherapies. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Subjects and materials 

The study included two large cohorts of medulloblastomas, one from 
Toronto and the other from the ICGC (Extended Data Fig. 1). The Toronto 
cohort consisted of 293 cases (whole-genome sequencing, 114 cases; 
RNA sequencing (RNA-seq) 225 cases; 46 cases overlapped), collected 
at diagnosis after informed consent was obtained from subjects as part 
of the Medulloblastoma Advanced Genomics International Consortium. 
All patient recruitment and tumour-sample collection was approved 
and in compliance with the ethical regulations of each of the following 
institutions: The Hospital for Sick Children, Seoul National University 
Children’s Hospital, The Children’s Memorial Health Institute, Mayo 
Clinic, The Chinese University of Hong Kong, John Hopkins University 
School of Medicine, Seattle Children’s Hospital, University of California 
San Francisco, McMaster University, Erasmus University Medical Center, 
Kitasato University School of Medicine, Fondazione IRCCS Istituto Nazi- 
onale Tumori, Emory University, Osaka National Hospital, Washington 
University School of Medicine, University of Calgary, Children’s Hospital 
of Pittsburgh, Hospital Pediatria CentroMédico Nacional Century XXI, 
University of Debrecen, McGill University, Vanderbilt Medical Center, 
University of Colorado Denver, Istituto Giannina Gaslini and Université 
de Lyon. The whole-genome sequences consisted of 109 published? and 
5 unpublished (WNT, n = 2; SHH, n = 37; group 3, n= 26; and group 4, 
n=49). Samples were obtained as freshly frozen tissue from the time of 
diagnosis and stored at —80 °C until processed for the purification of 
nucleic acids. Genomic DNA was isolated by incubation with protein- 
ase K overnight at 55 °C followed by 3 sequential phenol extractions 
and ethanol precipitation. Messenger RNA library construction and 
sequencing were performed as previously described”. The ICGC cohort 
consisted of 227 cases, which were downloaded from ICGC under acces- 
sion DACO-1036229. 


Whole-genome sequencing 

Whole-genome sequencing was performed at Canada’s Michael Smith 
Genome Science Centre at the BC Cancer Agency, using the Illumina 
HiSeq 2000/2500 platform as previously described”. 


Sequence alignment of whole-genome sequencing data 

Whole-genome sequencing reads were aligned to the human reference 
genome ‘hs37dS’ by 1000 Genomes Project Phase II, using Burrows- 
Wheeler aligner (BWA) — MEM version 0.7.8 with the ‘-T 0’ parameter. 
Duplicates were marked using biobambam v.0.0.148. Sequencing cov- 
erages were calculated using GenomonQC software, which was down- 
loaded from Genomon Project, and are shownin Supplementary Table 1. 


Somatic variant calling 

Somatic variants were called using eight variant callers: MuTect2”, 
EBCall”°, Varscan2”, Strelka”®, SomaticSniper”’, Virmid*°, Platypus” 
and Seurat™. 

MuTect2 was run using GATK v.3.5.0 with the default setting. Can- 
didate variants were discarded if the variants were detected in a panel 
of normal that was made by MuTect2 with ‘-artefact_detection mode’ 
and GATK ‘CombineVariants’ function with ‘-minN 2’. EBCall v.0.2.1 
was run with the default setting. We used the following criteria: requir- 
ing P value (by EBCall) < 10°, variant reads in tumour > 2 and variant 
reads in normal <1. Varscan2 v.2.4.3 was run with parameters ‘-strand- 
filter 1-min-var.-freq 0.08’. The results were filtered using the ‘fpfilter’ 
function with the option ‘-dream3-settings’. Strelka v.1.0.15 was run 
with default parameters. Virmid v.1.1.0 was run with the option ‘-q 10’. 
Somatic Sniper v.1.0.5.0 was run with the parameters ‘-Q 15 -q1-G-L’ 


and the results were filtered by the author’s recommended filter using 
bam-readcount. The candidates with more than 0.03 of variant allele 
frequency in the matched normal sample were discarded. Platypus 
v.0.8.1 was run with a default setting. Detected variants that passed the 
standard Platypus filtering criteria or showed ‘allele bias’ were used. We 
used the following additional criteria: requiring likelihood(reference 
allele)/likelihood(variant allele) <10~ in tumour, likelihood(variant 
allele)/likelihood(reference allele) < 10° in matched control, variant 
reads in tumour >2 and variant reads in normal <1. Seurat v.2.5 was run 
with the option ‘“—indels’. We used variants that were called by at least two 
callers. Obtained results were filtered by <2 variant reads inthe matched 
normal control, calculated using the realignment function of Genomon- 
MutationFilter v.0.2.1. Variants were annotated using ANNOVAR*®. The 
correlations of Uland U11 snRNA mutations with other somatic events 
were analysed using R package ‘Epi’ v.2.30. Asymptotic P values from 
odds-ratio tests were calculated using the twoby2 function, followed 
by Benjamini-Hochberg adjustment for multiple testing. 


Copy-number calling for whole-genome sequencing 
Copy-number alterations were detected using Control-FREEC v.10.3 
with the following parameters: breakPointType = 4, ploidy =”2,3,4’, 
step = 10000, window = 50000 (ref. *4). 


Variant calling of U1 and U11 snRNA genes 

To explore mutations in low-mappability regions, we first picked up 
reads from whole-genome sequencing data on Uland U11 snRNA genes 
and pseudogenes using samtools and biobambam. To accept multimap- 
ping, we used STAR aligner®. To prevent gaps, we set the setting with 
“-scoreGap -20 -alignEndsType EndToEnd’. Mutations were called by 
EBCall with the same setting as for whole-genome sequencing, except 
for the acceptance of secondary alignment. We used the following cri- 
teria: requiring P value (by EBCall) < 10°, variant reads in tumour > 4 
and variant reads in matched control <1. 

To evaluate the exact loci of variant reads and multiple mutations 
of U1 snRNAs, we mapped variant reads to case-specific references. 
First, we extracted all variant reads of U1 snRNA mutations (r.3A>G) 
with mate-paired reads. Then, we constructed a case-specific reference 
that included U1 snRNA hotspot mutations (r.3A>G) and case-specific 
germline variants detected from extracted variant reads, using the 
samtools mpileup function. Variant reads were mapped again on the 
case-specific reference, using bwa-mem with the same setting as was 
used in the whole-genome sequencing analysis. Using .bam files with 
the case-specific reference, we called variants in flanking regions of the 
Ul1snRNA hotspot mutation (r.3A>G), using samtools mpileup function 
to evaluate multiple mutations. No samples had recurrent variant reads. 
Therefore, we concluded that U1 snRNA mutation occurs in one allele. 
To interpret the mutated genes, we extracted consecutive consensus 
sequences of upstream U1 snRNA sequences with two or more supported 
reads. Then, the consensus sequence was mapped using BLAST software 
to U1snRNA genes and pseudogenes with 1,000-bp upstream sequences 
from the hg19 reference. Because there were many variants and a high 
level of similarity in the upstream sequences, we cannot detect the exact 
positions of mutated reads except for RNVUI-18 mutations. Therefore, 
we classified U1 snRNA mutations into (1) RNU1 genes (RNUI-1, RNU1-2, 
RNUI-3 or RNUI-4), (2) RNVUI-18 and (3) RNU1 pseudogenes (RNU1-27P 
or RNU1-28P) on the basis of the similarity of sequences of the flanking 
region. Finally, we performed a manual review of detected mutations 
with the Integrative Genome Viewer’®. Detected mutations are shown 
in Supplementary Tables 2-4. 


Secondary structure of U1 and U11 snRNAs 

The conservation scores of U1 (RFOOOO3) and U11 (RFO0548) snRNAs 
were downloaded from Rfam”’. U1 and U11 sequences of other species 
were downloaded from seed sequences from Rfam. The secondary struc- 
tures are described on the basis of the consensus structure in Rfam using 


VARNA software®®. U2-type intron and U12-type intron sequences were 
downloaded from SpliceRack”. 


rhAmp genotyping 

Genomic DNA from primary tumours was tested using custom rhAmp 
single-nucleotide polymorphism (SNP) assays (Integrated DNA Technol- 
ogy). In brief, locus- and allele-specific primers were generated individu- 
ally for RNUI_batch (RNUI-1, RNUI-2, RNU1-3, RNU1-4 and RNVUI-18) and 
RNU1 pseudo (RNUI-27P and RNUI1-28P). Assays were run in technical 
triplicate in 5 pl volume (DNA concentration was at least 5 ng/pl), with 
control gBlocks for wild-type, mutant and heterozygous genotypes. 
The reporter mix used Yakima Yellow (mutant) and FAM (wild-type) 
dyes, as well as ROX dye as a passive reference. Plates were read on the 
StepOnePlus (Applied Biosystems) RT-PCR machine, and genotypes 
were called using the StepOne v.2.3 software. The primer sequences 
are available in Supplementary Table 19. 


RNA-seq 

Sequencing reads were mapped by STAR v.2.5.1b on fasta, whichincludes 
the human reference genome hs37d5 by 1000 Genomes Project Phase 
Il, spike-in sequences of profile C1_2 ERCC spike-in concentrations 
used for C1 fluidigm and Caltech profile 3 spike-ins by Encyclopedia of 
DNA elements (ENCODE), with the options ‘-outFilterMultimapNmax 
20 -alignSJoverhangMin 8 -alignMatesGapMax 200000 -alignIntronMax 
200000 -alignSJDBoverhangMin 10 -alignSJstitchMismatchNmax 5 -1 
5 5-outSAMmultNmax 20 -twopassMode Basic’. Mapping results are 
shown in Supplementary Table 20. 


Intron-centric alternative splicing analysis 
Intron-centric alternative splicing analysis was performed using Leaf- 
Cutter". LeafCutter is an annotation-free quantification method. Intron 
clustering was run with minimum required reads = 50 and max _intron= 
500,000. LeafCutter was run with the option ‘-g 0’. Each of the 30 cases 
of each subtype of SHH medulloblastoma was compared with samples 
of the other subtypes, 5 adult brains and 4 fetal brains, using the default 
settings. Cases of SHHa with U1 snRNA r.3A>G mutation (n= 13) were 
compared with cases of SHHa with wild-type U1 snRNA (n= 39). The 
results we obtained were filtered by theg value of each cluster <0.01, in 
cases in which at least one absolute effect size calculated by LeafCutter 
was more than 1.5. Each event was annotated by LeafViz with GENCODE 
v.19 .gtf file. Events with unknown strand directions were not analysed. 
Logo sequences were built using R package ‘ggseqlogo’ v.0.1*°. Statis- 
tical analyses comparing sequences were performed by x’ test. The 
adjusted standardized residual was calculated by Haberman’s method. 
We selected cryptic 5’ splicing events with a C base at the sixth base in 
the intron. Subsequently, we further prioritized alternatively spliced 
genes that have previously been reported as recurrent genetic aberra- 
tions in SHH medulloblastoma*",, are transcriptionally upregulated or 
downregulated in both the SHHa and SHH6 subtypes’, or registered as 
tier 1 in Cancer Gene Census. 

t-distributed stochastic neighbour embedding (¢-SNE) analysis was 
performed using the R package ‘Rtsne’ v.0.13. Analysed events were 
chosen with the following: (1) significant events in at least one SHH 
subtype; and (2) lengths of the cluster of junction reads are same among 
all subtypes. Per cent spliced in was calculated by the number of junc- 
tion reads of alternative splicing events, divided by the total number 
of junction reads ina cluster. t-SNE was run with a default setting along 
with 3 WNT, 20 group 3 and 22 group 4 medulloblastomas, which were 
used ina previous study”. 


Exon-centric alternative splicing analysis 

Exon-centric alternative splicing analysis was performed using rMATS 
v.4.0.1'7. rMATS was run with default setting with GENCODE v.19 for 
alternative 3’ splice site, alternative 5’ splice site, retained intron and 
skipped exon. We filtered the events with FDR < 0.01 and change of 


splicing inclusion (calculated by rMATS) > 0.05. The Sashimi plots were 
described using MISO v.0.5.4°. 


Gene-set enrichment analysis of nonsense-mediated decay 

We counted reads using GENCODE v.19 .gtf file and htseq v.0.6.0 with the 
setting ‘-stranded reverse -m union’. Differential expression analysis was 
performed using DESeq2 v.1.16.1 with the default setting, after extract- 
ing genes expressed at >5 counts per million in at least 20% of cases. 
We performed two comparisons: SHH6 with mutant U1 snRNA (n=30) 
versus other SHH subtypes with wild-type U1 snRNA (n= 90) and SHHa 
with mutant U1 snRNA (n= 13) versus SHHa with wild-type U1 snRNA 
(n=39). Gene-set enrichment analysis (GSEA) for differentially expressed 
genes was performed using pre-ranked gene lists ordered by —log,, 
(Pvalue) multiplied by +1 for upregulation or -1 for downregulation with 
gsea v.3.0. We used two datasets for a pathway of nonsense-mediated 
decay, ‘GO nuclear transcribed mrna catabolic process nonsense medi- 
ated decay’ from the C5 gene set and ‘Reactome nonsense mediated 
decay enhanced by the exon junction complex’ from the C2 gene set. 


TP53 mutation status 

Germline mutations of 7P53 were analysed using EBCall v.0.2.1. EBCall 
was run with the default setting. We used the following criteria: requiring 
Pvalue (by EBCall) < 10°, 90% posterior quantile calculated by EBCall > 
0.3. The results were annotated using ANNOVAR. 

The mutation call from RNA-seq was run using GATK v.3.8.0. Adding 
read groups and flagging duplicate reads were performed using Picard 
tool v.2.18.0. Then, we split reads into exon segments using GATK with 
the setting ‘-rf ReassignOneMappingQuality -RMQF 255 -RMQT 60 -U 
ALLOW _N_CIGAR_READS‘’. Base recalibration was performed using GATK. 
Mutation call was performed using the ‘HaplotypeCaller’ function of 
GATK with the setting ‘-dontUseSoftClippedBases -stand_call_conf 
20.0’. Variants were filtered using the ‘VariantFiltration’ function of 
GATK with the setting ‘-window 35 -cluster 3 -filterName FS -filter “FS 
> 30.0” -filterName QD -filter “QD < 2.0”. The variants were discarded 
if they were also detected in any RNA-seq data generated from nine 
normal brain samples (five adult brains and four fetal brains). Sanger 
sequencing was as performed ina previous study“. We discarded the 
mutations that showed a frequency of 0.01 or higher in 1OOO Genomes 
v5b or ESP-6500, or dbSNP138. 


Survival analysis 

Overall survival and progression-free survival were evaluated using the 
log-rank with R package ‘survival’ v.2.40.1. Overall survival was defined 
as the time from date of surgery to death or date of last follow-up, and 
progression-free survival as the time from date of surgery to first event 
(progression or relapse) or date of last follow-up. 


Pan-cancer analysis 

We analysed 2,442 cases of cancer across 36 tumour types from ICGC. 
The hotspot mutations were analysed with the method described in 
‘Variant calling of Ul and U11 snRNA genes’, except for the use of the map- 
ping tool. For pan-cancer data, we used bowtie aligner instead of STAR*. 


SNP6 copy-number analysis 

Array files were downloaded from the Gene Expression Omnibus (GEO) 
under accession number GSE37385, and the relevant Affymetrix SNP6 
arrays were extracted. Affymetrix Power Tools v.1.18.2 was used to pro- 
cess and normalize the probe intensities to generate log R ratio (LRR) 
and B-allele frequency (BAF) using the PennCNV-Affy pipeline*®. The 
affygw6.hg19.pfb file was used to map the probes ontothehg19 genome. 
All other parameters were left as default. 

The resulting probe-level LRR and BAF values were taken into ASCAT 
v.2.4.3*7. GC wave correction was then performed, followed by predicting 
germline genotypes; this finally led to running the ASCAT algorithm to 
determine the copy-number values for each genomic region, as well as 


Article 


the overall ploidy and purity of the sample. Samples with a model fit that 
was less than 80% failed their ASCAT processing stage. log ratios for each 
segment were calculated by using the copy number of each segment, 
as well as the average ploidy of the sample, according to the equation: 


; copy number 
ratio = log ploidy 
Adjacent segments with log ratios that differed by less than 0.25 were 
then merged using their size-weighted mean: 


lengthl x ratiol + length2 x ratio2 
lengthl x ratiol 


newratio = 


Copy-number states were assigned to each segment on the basis of 
their log ratio and their ploidy values, according to the Supplementary 
Table 21. Broad copy-number changes are defined as occurring in 75% 
or more of chromosome arm in size. Focal copy-number variants were 
analysed using GISTIC v.2.0.23*. GISTIC was run with the settings ‘ta 
0.25 -td 0.3 -js 10 -brlen 0.7 -gcm “extreme” -armpeel’. 


PCR with reverse transcription and qPCR analysis 

RNA was obtained for samples from 18 patients that had fragments per 
kilobase of transcript per million mapped reads (FPKM) values of more 
than 2 for targeted genes from our larger cohort (6 SHHa with wild-type 
U1 snRNA, 6 SHHa with mutant U1 snRNA and 6 SHH6 with mutant U1 
snRNA). cDNA was synthesized using SuperScript III (Thermo Fisher 
18080400). PCRs were performed with cDNA and Taq polymerase using 
35 cycles, and products run ona 2% agarose gel. qPCRs were performed 
using SYBR-Green with ROX (Thermo Fisher 11744500), two steps at 
35 cycles. Calculation of AAC, was done comparing the expression of 
the mutant isoform to the wild-type isoform. The primer sequences are 
available in Supplementary Table 19. 


Generation of a lentiviral vector for the expression of UlsnRNA 
r.3A>G 

The pLKO.1-puro U6 sgRNA BfuAl stuffer lentiviral vector (Addgene no. 
50920) was modified by removing the internal U6 promoter (between 
Ndel and EcoRI), and it was replaced by the U1 snRNA locus, including 
393 bases of internal native U1 promoter, the Ul sequence, and 39 bases 
of 3’-flanking region using the following oligonucleotides (S’-GTCG 
AGAATTCTTGGCGTACAGTCTGTTTTT1G and 5’-CTATCATATGTAAGGA 
CCAGCTTCTTTGGGA). The PCR products were digested with Ndel and 
EcoRI, and cloned inthe modified pLKO.1 plasmid. The r.3A>G mutation 
was introduced by site-directed mutagenesis. All plasmids were verified 
by Sanger sequencing. 


Exogenous expression of U1 snRNA r.3A>G mutation 

Cell lines used were human embryonic kidney 293T (HEK-293T) from 
American Type Culture Collection (ATCC) (CRL-3216). Cell stocks were 
mycoplasma-tested before U1 vector transfection. HEK-293T cells were 
grown in DMEM, 10% FBS and 1% PSG. For exogenous expression of U1 
snRNA, HEK-293T cells (5 x 10° cells) were cultured in 10-cm plates and 
transfected using Lipofectamine Plus (Invitrogen) with 2 pg of either 
pLKO.1-U1wt (containing the wild-type U1 snRNA locus) or pLKO.1- 
UIr.3A>G (containing the r.3A>G mutation) in duplicate. Twelve hours 
after transfection, the medium was replaced with complete medium, 
and 48 h later the total RNA was extracted with the Trizol method. 


Verification of the expression of U1snRNAr.3A>G mutation 

Rapid amplification of cDNA ends (RACE) was performed using 1 pg of 
total RNA from HEK-293T cells transfected with either pLKO.1-U1wt or 
pLKO.1-U1r.3A>G following the recommendations of the manufacturer 
(Sigma-Aldrich 3353621001), and the following specific oligonucleotides 
(U1-RACE-SPI1: 5’- CAGGGGAAAGCGCGAACGCAGT and U1-RACE-SP2: 


5’-CCCACTACCACAAATTATGC). A single amplification band of the 
expected size (160 bp) was excised from the gel, purified and sequenced 
with the internal oligonucleotide U1-RACE-SP2. 


Sequence analyses of exogenous expression analysis 

Messenger RNA library construction was performed based on oligo dT- 
based mRNA isolation using NEBNext Poly(A) mRNA Magnetic Isolation 
Module. RNA sequencing was performed on NextSeq 550 using 100-bp 
paired-end mode. Mapping and intron clustering were performed 
with the methods described in ‘RNA-seq’ and ‘Intron-centric alterna- 
tive splicing analysis’. LeafCutter was run with the option ‘-g 0 -i 2’ 
and the obtained results were filtered by ag value of each cluster < 
0.1, in which at least 1 absolute effect size calculated by LeafCutter 
was more than 1.5. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Sequencing data have been deposited in the European Genome- 
Phenome Archive (EGA) and Gene Expression Omnibus (GEO): RNA- 
seq (EGADO0001001899 and EGAD00001004958), whole-genome 
sequences (EGAD00001003125 and EGAD00001004347) and RNA-seq 
of exogenous expression analyses (GSE128005). Materials used in this 
study are available from the following: GENCODE (ftp://ftp.ebi.ac.uk/ 
pub/databases/gencode/Gencode_human/release_19/gencode.v19.an 
notation.gtf.gz), ICGC (https://icgc.org/), hs37d5 reference (ftp:// 
ftp.1000genomes.ebi.ac.uk/voll/ftp/technical/reference/phase2_ref- 
erence_assembly sequence), Burrows—Wheeler aligner (bwa) (http:// 
bio-bwa.sourceforge.net/), Mutect2 (https://software.broadinsti- 
tute.org/gatk/), EBCall (https://github.com/friend1ws/EBCall), Var- 
scan2 (http://dkoboldt.github.io/varscan/), Strelka (https://github. 
com/IIlumina/strelka), SomaticSniper (http://gmt.genome.wustl. 
edu/packages/somatic-sniper/), Virmid (https://sourceforge.net/p/ 
virmid/wiki/Home/), Platypus (http://www.well.ox.ac.uk/platypus), 
Seurat (https://sites.google.com/site/seuratsomatic/home), ENCODE 
(https://www.encodeproject.org/), PennCNV (http://penncnv.open- 
bioinformatics.org/en/latest/), Database of Genomic Variants (http:// 
dgv.tcag.ca/dgv/app/home), Genomon Project (https://github.com/ 
Genomon-Project), SpliceRack (http://katahdin.mssm.edu/splice/index. 
cgi?database=spliceNew) and GEO (https://www.ncbi.nIm.nih.gov/geo/) 
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for alternative splicing analysis. b, Cohort specification. c, Subgroup distribution of whole-genome sequencing (WGS) cohorts. 
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Extended Data Fig. 2|See next page for caption. 
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Extended Data Fig. 2| U11snRNA mutations, and conservation of U1 and U11 
snRNA genesacross evolution. a, Seed sequences of the U1 snRNA obtained 
from the Rfam database demonstrate high-level conservation across a variety of 
eukaryotic species, particularly at the site of the SHH medulloblastoma 
mutation. The consensus sequence and first 50 nucleotides of reference 
sequences are included for comparison. Grey indicates nucleotide differences, 
and red identifies the SHH medulloblastoma hotspot mutation. b, Cartoon 
illustrating the number of somatic mutations in the U11 snRNA genes. Sequence 
conservation scores for U11snRNA, as determined using the Rfam database. c, 


Secondary structure of the mutant U11 snRNA. The red circle identifies the 
location of the hotspot mutation. The yellow and green rectangles indicate the 
5’ splice-site recognition site and the Sm protein-binding site, respectively. 
Numerals ItoIV indicate stem loops. d, Seed sequences of the U1ll snRNA 
obtained from the Rfam database demonstrate high-level conservation 

across a variety of eukaryotic species, particularly at the site of the SHH 
medulloblastoma mutation. The consensus sequence and first 30 nucleotides of 
reference sequences are included for comparison. Grey indicates nucleotide 
differences, and red identifies the SHH medulloblastoma hotspot mutation. 
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Extended Data Fig. 3 | High levels of genomic conservation surrounding 
human U1snRNAs complicate the specific PCR amplification of any 
individual locus. a, Genomic locations of the 4 expressed U1snRNA genes (on 
chromosome lp, red) and 136 pseudogenes across the Homo sapiens genome, 
as indicated. Three pseudogenes with sequences that are identical (hg19) tothe 
expressed U1snRNA genes are indicated in orange. b, Average mapping quality 
of bwa-mem and coverage of each of the expressed Uland U11 snRNA genes 
from whole-genome sequencing of germline samples from patients with 
medulloblastomaare illustrated (n =341 patients). Blue bars represent the 
alignability of 10O-mers using the GEnome Multitool (GEM) mapper from 


ENCODE and Centre for Genomic Regulation (CRG). Regions that are >1,000 
bases upstream and downstream are shown ona log,, scale. Red bar indicates 
the gene body. c, Average number of multimapped reads overlapped for each 
gene pair using STAR aligner. The heat map shows the average number of 
mapped reads across the whole-genome sequencing of germline samples from 
patients with medulloblastoma (n= 341 patients). d, Sequence similarity of U1 
snRNA genes, U1snRNA pseudogenes with 164 identical base pairs and the U11 
snRNA gene. The numbers in each square and heat map indicate identity scores 
and bit scores calculated using blast software. A blank square indicates that no 
hit was found. 
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Extended Data Fig. 4 | Allele-specific rhAmp SNP PCR of RNU1 loci, copy- 
number changes in SHH medulloblastoma with mutant U1 snRNA versus 
wild-type U1 snRNA, and prognostic analysis. a, The frequency of any U1 
snRNA mutation inthe RNU1_batch primer set (RNUI-1, RNUI-2, RNU1-3, RNUI1-4 
and RNVUI-18) (left) and the RNU1_pseudo primer set (RNUI-27P and RNUI-28P) 
(right). b, Hotspot mutations of RNUI-27P or RNUI-28P U1 snRNA pseudogenes, 
as confirmed by Sanger sequencing. c, Broad copy-number aberrations in SHHa 
with wild-type U1 snRNA (n=25), SHHa with mutant U1snRNA (n=8) and SHH8 
with mutant UlsnRNA (n= 41). Dark blue and dark red bars, as well as asterisks, 
identify statistically significant regions, comparing SHHa with mutant versus 
wild-type U1lsnRNA (P< 0.05, two-sided Fisher’s exact test). d, Significant focal 
copy-number aberrations in SHHa with wild-type Ul snRNA (n=25), SHHa with 
mutant U1 snRNA (n=8) and SHH6 with mutant U1 snRNA (n=41) illustrate 


significant genomic differences between cases with wild-type and mutant U1 
snRNA. Candidate target genes within the corresponding loci are indicated. 
qvalues were calculated using GISTIC (Methods). e-g, Overall survival of 
patients with SHHa, stratified by mutational status of UlsnRNA mutation 

(n=10 for mutant, n=27 for wild type) (e), 7P53 (n=15 for mutant, n=22 for wild 
type) (f) or both (n=9 for both mutant, n=1 for U1snRNA mutation only, n=6 for 
TP53 mutation only, n=21 for both wild type) (g). Pvalues were determined using 
the two-sided log-rank test. h, i, Progression-free survival (h) and overall survival 
(i) stratified by U1 snRNA mutation and SHH subtype (n=10 for SHHa with 
mutant U1snRNA, n=27 for SHHa with wild-type U1 snRNA, n= 23 for SHHB with 
wild-type U1 snRNA, n= 24 for SHHy with wild-type Ul snRNA, n=46 for SHH& 
with mutant U1snRNA). P values were determined using the two-sided log-rank 
test. +, censored case. 
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a, Quantification of alternative splicing events by SHH subtype, as detected by absolute log effect size > 1.5 calculated using LeafCutter (Methods)) are 
intron-centric alternative splicing analysis (n =30 of each subtype). Bar plot illustrated by colour. Alternative splicing events of PTCHI and GL/2with the 
shows adjusted standardized residual of included alternative splicing events. highest effect size are annotated. c, Splice-site sequences of included alternative 


Positive values indicate a relatively higher number, and negative values indicate splicing events by subtype (n=30 of each subtype). Asterisk denotes nucleotide 
arelatively lower number among subtypes. b, Volcano plots of alternative sites with q value < 107 (x* test and Benjamini-Hochberg method). 
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Extended Data Fig. 6 | Intron-centric analysis of SHHa medulloblastomas. 
a, b, Quantification (a) and proportion (b) of alternative splicing events between 
SHHa medulloblastoma with mutant U1 snRNA (n=13), and SHHa 
medulloblastoma with wild-type U1 snRNA (n= 39) as detected by intron-centric 
alternative splicing analysis. P value was calculated by x’ test. c, Volcano plots of 
alternative splicing events (n= 13 for SHHa with mutant U1 snRNA, n=39 for 
SHHa with wild-type U1snRNA). The x axis shows the difference of per cent 
spliced in calculated using LeafCutter. Significant events (FDR<0.0land 
absolute log effect size > 1.5, calculated by LeafCutter (Methods)) are illustrated 
by colour. d, Splice-site sequences of included alternative splicing events in 


SHHa with mutant U1snRNA (n= 13 for SHHa with mutant U1 snRNA, n=39 for 
SHHa with wild-type U1 snRNA). Size and colour for each circle indicate the 
qvalues and Cramer’s Vvalues for each nucleotide position (q values were 
calculated by y’ test and Benjamini-Hochberg method; the precise values are 
givenin Supplementary Table 11). e, Residual analysis of 5’ splice-site sequences 
of annotated and cryptic 5’ alternative splicing (n= 13 for SHHa with mutant U1 
snRNA, n=39 for SHHa with wild-type Ul snRNA). The size and colour of each 
circle denote the two-sided P value, and adjusted standardized residual 
calculated by Haberman’s method. The precise values are givenin 
Supplementary Table 11. 
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Extended Data Fig. 7 | Nonsense-mediated decay pathway in SHH 
medulloblastoma with mutant U1 snRNA, and exogenous expression 
analyses. a, Enrichment plots of ‘GO nuclear transcribed mRNA catabolic 
process nonsense mediated decay’ by GSEA between SHH6 with mutant U1 
snRNA (n=30) and other subtypes of SHH medulloblastoma with wild-type 
Ul1snRNA (n= 90), and SHHa with mutant U1 snRNA (n=13) and SHHa with 
wild-type U1lsnRNA (n=39). Pvalues were calculated using gsea v.3.0 (Methods). 
b, Quantification of alternative splicing events between HEK-293T cells with 
mutant U1lsnRNA and HEK-293T cells with wild-type U1 snRNA, as detected by 
intron-centric alternative splicing analysis. c, Splice-site sequences of included 
alternative splicing events in HEK-293T cells with mutant U1snRNA. Asterisk 
denotes nucleotide sites with g value < 10 (x? and Benjamini-Hochberg 


method). d, Comparison of the extent of overlap between detected alternative 
splicing events by SHH medulloblastoma (either of SHHa or of SHH) with 
mutant U1lsnRNA, SHH medulloblastoma (either of SHHa, SHHB or SHHy) with 
wild-type Ul snRNA and HEK-293T cells with mutant U1 snRNA exogenous 
expression. Left, alternatively spliced events with cryptic’ sites. Right, 
alternatively spliced events with cryptic 5’ sites and C base at the 6thintron. 

e, Alternative splicing signatures by ¢-SNE analysis. Left, the per cent spliced-in 
values of detected cryptic 5’ alternative splicing events, with a‘C’ nucleotide at 
the 6th base in the intron from the S’ splice site. Top right, per cent spliced-in 
values of all cryptic 5’ alternative-splicing events. Bottom right, per cent spliced- 
in values of all alternative splicing events. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Retained introns inactivate tumour-suppressor genes 
intumours with U1snRNAr.3A>G mutation. a, Illustration of the different 
types of alternative splicing events analysed using rMATS (n=30 of each 
subtype). Red arrows indicate expected 5’ prime sites recognized by the mutant 
Ul1snRNA.b, Quantification of alternative splicing events by subtype of SHH 
medulloblastoma, as detected by exon-centric alternative splicing analysis. 

c, Scatter plots of alternative splicing events (n=30 of each subtype). The x axis 
shows the difference of per cent spliced in (psi) calculated using rMATS. 
Different types of significant event (FDR < 0.01 and absolute differential psi > 
0.05 calculated using rMATS (Methods)) are illustrated by different colours, as 
annotated. d, Splice-site sequences of alternative 5’ splice site (A5SS), included 
cassette exon (CE) and included retained intron (RI) events in SHH6 with mutant 
U1snRNA (n=30). Each event corresponds toared arrowcartoon ina. Asterisk 
denotes nucleotide sites with g value < 10? (y? test and Benjamini-Hochberg 
method). e, Distribution of per cent spliced in for PAX6 based on U1snRNA 
mutation status (n=13 for SHHa with mutant U1lsnRNA, n=30 for SHH6 with 
mutant U1snRNA, n=99 for SHH medulloblastoma with wild-type Ul snRNA 
(n=90) and normal brain tissue (n= 9)). Dashed line defines threshold that 
divides the dataset into two groups (k-means method). The table displays the 
number of samples above the threshold (high) or below (low) based on 


mutational status. P value was calculated using two-sided Fisher’s exact test 
compared to samples with wild-type U1snRNA. Samples with mutant UlsnRNA 
are indicated in pink, and wild-type samples in blue. f, Sashimi plot of splicing of 
PAX6 based on mutational status determined by exon-centric alternative 
splicing analysis (rMATS). The bar plot shows modified FPKM. Numbers refer to 
average junctional reads across all samples. Annotated exon tracks are shown 
below, with genomic positions marked. g, Distribution of per cent spliced in for 
TOX4 based on U1 snRNA mutation status (n= 13 for SHHa with mutant U1 
snRNA, n=30 for SHH6 with mutant U1 snRNA, n=99 for SHH medulloblastoma 
with wild-type U1snRNA (n= 90) and normal brain tissue (n= 9)). Dashed line 
defines threshold that divides the dataset into two groups (k-means method). 
The table displays the number of samples above the threshold (high) or below 
(low) based on mutational status. P value was calculated using two-sided Fisher’s 
exact test compared to samples with wild-type Ul snRNA. Samples with mutant 
UlsnRNA are indicated in pink, and wild-type samples in blue. h, Sashimi plot of 
splicing of TOX4 based on mutational status determined by exon-centric 
alternative splicing analysis (rMATS). The bar plot shows modified FPKM. 
Numbers refer to average junctional reads across all samples. Annotated exon 
tracks are shown below, with genomic positions marked. 
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Extended Data Fig. 9 | Exon-centric analysis of SHHa medulloblastomas and 
overlapped splicing events. a, Quantification of alternative splicing events 
between SHHa medulloblastoma with mutant U1 snRNA and SHHa 
medulloblastoma with wild-type U1snRNA, as detected by exon-centric 
alternative splicing analysis. b, Scatter plots of alternative splicing events (n=13 
for SHHa with mutant U1 snRNA, n=39 for SHHa with wild-type Ul snRNA). Thex 
axis shows the difference of per cent spliced in calculated using rMATS. Different 
types of significant event (FDR <0.01land absolute differential psi> 0.05 
calculated using rMATS (Methods)) are illustrated by different colours, as 
annotated. c, Splice-site sequences of alternative 5’ splice sites, included 
cassette exon and included retained intron events in SHHa medulloblastoma 
with mutant U1snRNA and SHHo medulloblastoma with wild-type UlsnRNA. 


Each event corresponds toa red arrow cartoon in Extended Data Fig. 8a. Asterisk 
denotes nucleotide sites with q value < 10? (y’ test and Benjamini-Hochberg 
method). d, Box plot of fold changes in expression of the alternatively 

spliced isoform as compared to the wild-type isoform in subsets of SHH 
medulloblastoma, as determined by real-time qPCR. Inthe box plots, the centre 
lines show data median; box limits indicate the IQR from the 25th and 75th 
percentiles; lower and upper whiskers extend 1.5x the IQR. Outliers are 
represented by individual points. P values were calculated using two-sided 
Wilcoxon rank-sum test. e, Comparison of the extent of overlap between 
splicing events by subtype of SHH medulloblastoma and U1snRNA mutational 
status. Effect sizes are calculated by LeafCutter with an absolute effect-size 
threshold of 1.5. 
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Extended Data Fig. 10 | Aberrant splicing of oncogenes and tumour- 
suppressor genes in SHH medulloblastoma with mutant U1snRNA. 

a, Overview of cryptic alternative splicing of GL/2, demonstrating the position of 
acryptic cassette exon with the 5’ splice-site sequence. b, Sashimi plot of 
splicing of GL/2in representative cases. The bar plot shows counts per million 
reads. Numbers are of junctional reads; reads for the mutant U1lsnRNA isoform 
areinred.c, Scatter plot comparing detected alternatively spliced read and total 
junction reads that share a3’ splice site. Jittering was performed for both values. 
d, Per cent spliced in values for SHHa with mutant U1 snRNA, SHH6 with mutant 
UlsnRNA and SHH medulloblastoma with wild-type U1 snRNA (all subtypes of 
SHH medulloblastoma). e, Box plot of fold changes in expression of the 
alternatively spliced isoform as compared to the wild-type isoform of GL/2in 
subsets of SHH medulloblastoma, determined by real-time qPCR. f, Illustration 
of canonical and cryptic isoforms of GL/2. Translation start sites are indicated by 
an ATG arrow. Resulting proteins (and sizes) are displayed for eachisoform. 
Repression and activation domains are indicated in blue and orange, 
respectively. g, Overview of cryptic alternative splicing of CCND2, illustrating 
the position of a cryptic cassette exon with the S’ splice-site sequence. h, Sashimi 
plot of representative cases demonstrates alternative splicing at the CCND2 
locus. Numbers illustrate junctional reads. Junctional reads specific to U1 snRNA 
mutants areinred.i, The canonical isoform and the cryptic isoform of CCND2. 


j, Scatter plot comparing detected alternatively spliced read and totaljunction 
reads that shared a3’ splice site. Jittering was performed for both values. k, Per 
cent spliced in values for U1-mutant SHHa (n= 13), Ul-mutant SHH6 (n=58), and 
Ul1-wildtype SHH (all SHH subtypes, n=104). |) Real-time qPCR comparing the 
expression of the cryptic isoform of CCND2 demonstrates high levels of 
expression of CCND2 restricted to SHH5 cases (n= 6 for SHHa with mutant U1 
snRNA, n= 6 for SHH6 with mutant U1 snRNA, n=6 for SHHa with wild-type U1 
snRNA). m, Overview of cryptic alternative splicing of PAXS, illustrating the 
position of acryptic cassette exon with the S’ splice-site sequence. n, Sashimi 
plot of representative cases demonstrates alternative splicing at the PAXS locus. 
Numbers denote junctional reads. Junctional reads specific to UlsnRNA 
mutants are in red. o, The canonical isoform and the cryptic isoform of PAXS. 

p, Scatter plot comparing detected alternatively spliced read and total junction 
reads that shared a3’ splice site. Jittering was performed for both values. q, Per 
cent spliced in values by SHHa with mutant U1snRNA (n=5), SHH6 with mutant 
U1snRNA (n=27) and SHH medulloblastoma with wild-type U1 snRNA (all 
subtypes of SHH medulloblastoma, n=7). In all box plots, centre lines show data 
median; box limits indicate the IQR from the 25th and 75th percentiles; lower and 
upper whiskers extend 1.5x the IQR. Outliers are represented by individual 
points. Pvalues were calculated using two-sided Wilcoxon rank-sum tests. 
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our mutation call pipeline described in Methods section. 


Replication rhAmp SNP genotyping were done three times. All attempts at replication were successful. All qPCR was done in technical triplicate aside from 
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Cancers are caused by genomic alterations known as drivers. Hundreds of drivers in 
coding genes are known but, to date, only a handful of noncoding drivers have been 
discovered—despite intensive searching”. Attention has recently shifted to the role 

of altered RNA splicing in cancer; driver mutations that lead to transcriptome-wide 
aberrant splicing have been identified in multiple types of cancer, although these 
mutations have only been found in protein-coding splicing factors such as splicing 
factor 3b subunit 1 (SF3B1)*°. By contrast, cancer-related alterations in the noncoding 
component of the spliceosome—a series of small nuclear RNAs (snRNAs)—have barely 
been studied, owing to the combined challenges of characterizing noncoding cancer 
drivers and the repetitive nature of snRNA genes’”*. Here we report a highly recurrent 
A>C somatic mutation at the third base of U1 snRNA in several types of tumour. The 
primary function of U1 snRNA is to recognize the 5’ splice site via base-pairing. This 
mutation changes the preferential A-U base-pairing between U1 snRNA and the 

5’ splice site to C-G base-pairing, and thus creates novel splice junctions and alters the 
splicing pattern of multiple genes—including known drivers of cancer. Clinically, the 
A>C mutation is associated with heavy alcohol use in patients with hepatocellular 
carcinoma, and with the aggressive subtype of chronic lymphocytic leukaemia with 
unmutated immunoglobulin heavy-chain variable regions. The mutation in UlsnRNA 
also independently confers an adverse prognosis to patients with chronic lymphocytic 
leukaemia. Our study demonstrates a noncoding driver in spliceosomal RNAS, reveals a 


mechanism of aberrant splicing in cancer and may represent a new target for 
treatment. Our findings also suggest that driver discovery should be extended toa 
wider range of genomic regions. 


To determine the extent of U1 snRNA (hereafter, U1) mutations in can- 
cer, we first screened 2,583 whole-genome sequenced donors across 
37 tumour types from the ‘Pan-Cancer Analysis of Whole Genomes’ 
(PCAWG) project’ (Supplementary Table 1). The human genome 
(GRCh37) has 7 genes with the same canonical 164-bp U1 sequence, 
and more than 130 pseudogenes with variant U1 sequences; the flanking 
sequences of the genes and pseudogenes are also highly similar’* (Sup- 
plementary Note). We therefore called somatic mutations across canoni- 
cal U1 genes by using reads mapped only to them (Extended Data Fig. 1a), 
and reported all possible mutated genes for each U1 mutation. Within 
the 2,434 donors who had sufficient coverage, we identified 277 somatic 
mutations in U1 genes that affected 240 donors, across 30 tumour types 


(Fig. 1a, Supplementary Table 2). These mutations spanned 100 of the 
164 bases of U1, but only 2 positions (base 3 and base 28) were mutated 
in more than 5% of donors in at least 1 tumour type. 

Base 28 of U1 falls in a stem loop, and was recurrently mutated in 4 
out of 23 (17.4%) bladder cancers. This may be related to the previously 
reported higher mutation rate among palindrome loops in bladder 
cancer”. The third base of U1 contained 27 A>G and 21 A>C mutations 
across five types of tumour. This base forms part of the highly conserved 
5’splice-site recognition sequence (nucleotides 3-10) of U1, which base- 
pairs directly with 5’ splice site". Five additional A>C mutations were 
recovered from samples with insufficient coverage (Supplementary 
Note). Collectively (Fig. 1b), the A>G mutation was found in 26 out of 
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Fig. 1| Overview of somatic mutations in U1. a, Distribution of mutations in 
canonical Ul genes. The red shaded region indicates the 5’ splice-site 
recognition sequence. b, Recurrent mutations at the third base of Ulacross five 
tumour types. Adeno CA, adenocarcinoma; CNS, central nervous system; BNHL, 
Bcell non-Hodgkin lymphoma; medullo, medulloblastoma; panc, pancreas. c, 
Distribution of mutations within the 5’ splice-site recognition sequence by 


135 (19.3%) cases of medulloblastoma (which are described in detail in 
the accompanying paper”) and 1 out of 230 (0.4%) cases of pancreatic 
adenocarcinomas. The A>C mutation (hereafter g.3A>C) was found in 
8 out of 78 (10.3%) cases of chronic lymphocytic leukaemia (CLL), 16 
out of 189 (8.5%) cases of hepatocellular carcinoma (HCC) and 2 out of 
107 (1.9%) cases of B cell non-Hodgkin lymphomas. Mutations were also 
found in other bases of the S’ splice-site recognition sequence, but at 
a much lower frequency (Fig. 1c); a preponderance of non-third-base 
mutations (9 out of 20) was observed in B cell non-Hodgkin lymphomas. 

To enlarge our sample size, we used widespread alterations in splic- 
ing and expression patterns of samples with U1 mutations to infer the 
mutational status of the third base in tumours that were associated 
with RNA-sequencing (RNA-seq) data only (Extended Data Fig. 1b, c 
and 2f). Using machine learning (Methods, Supplementary Note), we 
predicted that an additional 4 out of 240 (1.7%) cases of CLL and 14 out 
of 321 (4.4%) cases of HCC contain the g.3A>C mutation. To benchmark 
the classifier, we validated the g.3A>C status for 298 cases of CLL usinga 
PCR-based SNP genotyping system (rhAmp) (Methods). Only one sample 
showed an inconsistent genotype, which was treated as mutated in sub- 
sequent analysis (Supplementary Note). By combining the results based 
on whole-genome sequencing and analyses of the transcriptome, the 
recurrent g.3A>C mutation was found in 12 out of 318 (3.8%) donors with 
CLL and 30 out of 510 (5.9%) donors with HCC (Extended Data Fig. Id,e, 
Supplementary Table 3). 

The splice-site consensus motif prefers a U at the sixth position of 
the 5’ splice site; we hypothesized that the g.3A>C mutation could shift 
this preference towards a G (which we hereafter term the G6 5’ splice 
site) (Fig. 1d). Using RNA-seq data from matched cases of CLL (11 with 
U1 mutation versus 254 with wild-type U1) and HCC (20 with U1 muta- 
tion versus 367 with wild-type U1), we performed differential splicing 
analysis using annotation-free and intron-centric software (LeafCutter)” 
(Methods). We identified 3,193 and 533 differentially spliced introns 
in 1,519 and 303 genes (LeafCutter g < 0.1 and absolute log, (effective 
size) >1) in CLLand HCC, respectively (Fig. 2a, Supplementary Table 4). 

For each intron, we further determined its direction of change using 
the change in per cent spliced in (APSI) (Extended Data Fig. 2b). When 
comparing the base composition of the 5’ splice site among introns 
with increased excision (APSI > 0) and decreased excision (APSI <0) in 
samples with U1 mutation, we observed significant differences at the 
sixth position for both CLL (x? test, P= 3.3 x 10) (Fig. 2b) and HCC (x 


tumour type. d, The RNA-RNA interaction between Uland the’ splice site. 
Bases 3 to 10 of U1 (red box and numbering) can base-pair with the 5’ splice site 
(black box and numbering). The base-pairing affected by the g.3A>C mutation is 
in blue. PhyloP scores show the conservation levels of human canonical 5’ splice 
site (n=344,580 introns). Centre line, median; box limits, upper and lower 
quartiles; whiskers, 1.5x interquartile range; points, outliers. 


test, P=6.7 x10°°) (Extended Data Fig. 2c). Consistent with expectations, 
introns with increased excision were highly enriched in the G6 5’ splice 
site relative to introns with decreased excision (38.4% versus 16.4% in 
CLL; 47.0% versus 22.1% for HCC) (Fig. 2b), or to genome-wide canonical 
introns (18.9%) (Fig. 1d). In consequence, we observed many novel splic- 
ing events in samples with U1 mutation—especially for splicing with the 
cryptic G6 5’ splice site in both types of tumour (Fig. 2c, Extended Data 
Fig. 2d). Together, these data support the hypothesis that the g.3A>C 
mutation increases the splicing rate of the G6 5’ splice site. 

Because splicing and expression frequently correlate’, we also 
conducted differential expression analysis for the g.3A>C mutation 
(Extended Data Fig. 2e). This analysis revealed 869 and 68 differentially 
expressed genes (q< 0.1and absolute log,-transformed fold change > 1) 
(Supplementary Table 4) for CLL and HCC, respectively. More genes were 
upregulated than downregulated in the samples with U1 mutations: 561 
out of 869 and 66 out of 68 in cases of CLLand HCC, respectively (Fig. 2d). 

We next investigated genes affected by the U1 mutation. We found 
that 84 and 16 genes in the Cancer Gene Census (v.84) were mis-spliced 
in cases of CLL and HCC, respectively”. Among these genes, 44 and 
10—respectively—had increased excision of G6 5’ splice site introns, 
including known drivers of CLL and HCC (Supplementary Table 4). 
The most significant mis-spliced cancer gene in CLL was musashi RNA 
binding protein 2 (MS/2) (LeafCutter g =1.2 x 10°); CLL with U1 muta- 
tions exclusively expressed a cryptic exon that contains a premature 
termination codon, and was associated with a G6 S’ splice site (Fig. 3a, 
Extended Data Fig. 3a). A similar pattern was observed for the gene DNA 
polymerase delta 1, catalytic subunit (POLDI) (q = 4.2 x 10°) (Fig. 3b, 
Extended Data Fig. 3a). As the cryptic exon affected the polymerase—but 
not the exonuclease—domain of POLDI, the g.3A>C mutation was not 
associated with a higher mutation burden. 

We also found mis-splicing in other genes related to CLL biology, 
suchas the hyaluronic acid receptor gene CD44 molecule (Indian blood 
group) (CD44). CD44 was the most significantly differentially spliced 
gene (LeafCutter g=5.1 10"). Alternative splicing of CD44 is tissue- 
specific and has previously been associated with processes such as 
lymphocyte homing and tumorigenesis; the gene is also thought to 
regulate anti-apoptosis signalling in CLL”. Patients with wild-type CLL 
expressed predominantly the standard isoform (CD44s, which does 
not contain exon v2-v10) (Fig. 3c), whereas cases of CLL with U1 muta- 
tions overexpressed multiple variant isoforms (CD44v)—presumably 
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Fig. 2| Global gene splicing and expression changes associated with the 
g.3A>C mutation. a, Pvalue quantile-quantile plots for differential splicing 
analysis. Pvalues are from LeafCutter. b, 5’ splice site (SS) for introns with 
increased (n=1,657 introns) or decreased excision (n=1,536 introns) in cases of 
CLL with U1 mutation (n=11 patients). The top bar chart shows q values from x” 
tests for base composition difference; the red line indicates q=0.1.c, Category 
of mis-splicing events in CLL. Extended Data Figure 2a provides the definitions 


because the presence of several G6 5’ splice sites increased the excision 
rate of introns associated with variant exons (Fig. 3d). Another, similar 
example is ATP-binding cassette sub-family D member 3 (ABCD3), afatty 
acid transporter for peroxisomes; two cryptic exons were expressed 
exclusively in cases of CLL with U1 mutations (Extended Data Fig. 3a-c). 
The consistent combination of frequent mis-splicing (¢=7.1 10) and 
overexpression (log,-transformed fold change = 2.3 and q=3.7 x10°°°) 
in ABCD3 enabled us to create a single-gene score that predicted g.3A>C 
mutational status with 100% accuracy in CLL (Extended Data Fig. 3d, 
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Fig. 3 | Cancer-related genes that are mis-spliced in CLL with U1 mutations. 
a-c, Sashimi plots showing mis-splicing patterns of MS/2 (a), POLD1 (b) and CD44 
(c). MUT (n=11) and WT (wild type) (n=254) represent samples of CLL with and 
without g.3A>C mutations, respectively. For each genotype (MUTinred; WT in 
blue), the three tracks from top to bottom show splice junctions, average 
expression levels in counts per million (CPM) and gene models, respectively. 
Each splice junction is shownasa curve and weighted by PSI values from 
LeafCutter. In gene models, black boxes are annotated exons, and red boxes 
indicate the cryptic (a, b) or variant (c) exons. The introns for MS/2 are 
downscaled for better visualization (dashed lines). Gene models for MS/2 
(ENST00000284073.2) and POLDI (ENSTO00000599857.1) are based on primary 
transcripts; the gene model of CD44 is shown asa cartoon. d, Isoform expression 
of CD44. Pvalues are from two-sided Wilcoxon rank-sum tests. MUT (n= 6) and 
WT (n=61) represents biologically independent CLL samples with and without 
g.3A>C mutations, respectively. For the box plot, centre line, box limits, 
whiskers and points represent the median, 25th and 75th percentiles, 1.5x 
interquartile range and individual samples, respectively. 
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of each category. The number of introns is coloured by the sixth base of 5’ splice 
site. d, Volcano plots for differential expression analysis. Pand q values are from 
limma. NS, not significant (q>0.1or log,-transformed fold change <1). Fora,d, 
biologically independent patient samples are used for CLL (11 with U1 mutation 
versus 254 with wild-type U1) and HCC (20 with U1 mutation versus 367 with wild- 
type U1). 


e). We experimentally validated the differentially spliced junctions of 
MSI2, POLD1, CD44. and ABCD3 using quantitative PCR (Extended Data 
Fig. 4a-c). 

Genes the aberrant splicing of which introduces a premature termina- 
tion codon are expected to be targeted by nonsense-mediated decay. 
However, mis-spliced forms of MS/2, POLD1 and ABCD3 that contained 
a premature termination codon were not downregulated in cases of 
CLL with U1 mutation, even though the distance between the prema- 
ture termination codon and the final exon—-exon junction exceeded 
the distance (55 nucleotides) needed for nonsense-mediated decay"® 
(Extended Data Fig. 3a, b). 

Next, we investigated the pathway-level changes that are associated 
with the g.3A>C mutation in CLL using gene-set enrichment analysis” 
(Supplementary Table 5). In CLL with U1 mutation, we found that genes 
related to mRNA transcription, RNA splicing, protein ubiquitination 
and telomere maintenance were upregulated (Extended Data Fig. 5a, 
b), whereas genes related to apoptosis, B cell receptor signalling and 
cytoplasmic ribosomes were downregulated (Extended Data Fig. 5c-g). 
The downregulation of ribosomal genes may explain the reduced rates 
of nonsense-mediated decay’ that were noted earlier. 

To validate our findings, we introduced exogenous U1 genes with or 
without the g.3A>C mutation into three CLL cell lines JVM3, HG3 and 
MEC1). After confirming the exogenous expression of U1 (Extended Data 
Fig. 6a), we performed the same transcriptome analysis using cell-line 
RNA-seq data. In total, 7,238 introns in 2,365 genes were differentially 
spliced, and 459 genes were differentially expressed in cell lines that 
contained U1 mutations (Supplementary Table 4). Cell lines with U1 
mutations also had many cryptic S’ splice-site splicing events, and more 
G6 S’ splice site in introns with increased excision than in introns with 
decreased excision (34.4% versus 15.3%; x? test P=1.9 x 10°”) (Extended 
Data Fig. 6b, c); 39.1% of the G6 5’ splice site introns with increased exci- 
sion in patients with U1 mutations were also shared by cell lines with 
U1 mutations (Extended Data Fig. 6d). In addition, cell lines with U1 
mutations also had more genes upregulated (361 genes) than downregu- 
lated (97 genes) (Extended Data Fig. 6e), and shared many differentially 
expressed genes with primary CLL (Extended Data Fig. 6f, g). These 
data validate a causal link between the g.3A>C mutation and global 
splicing changes. 

We further studied interactions between the g.3A>C mutation and 
other drivers of CLL and HCC. The g.3A>C mutation significantly co- 
occurred with the mutation of NFKB/E in CLL (Fisher’s g = 0.0077) (Fig. 4a, 
Extended Data Fig. 7), the mutation of APOB in HCC (Mantel-Haenszel 
test g = 0.018) (Extended Data Fig. 9), and the mutation of the TERT 
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Fig. 4 | Driver alterations and clinical features related to the g.3A>C 
mutation. a, Clinical features and selected driver events in CLL (n=313 
patients). The right bar chart shows the Benjamin—Hochberg adjusted P values 
from two-sided Fisher’s exact tests. Complete driver events are in Extended Data 
Fig. 7. CLL/MBL, CLL or monoclonal B cell lymphocytosis. b, Distribution of 
effect size for mis-spliced introns in CLL with U1 mutations or SF3BI mutations. 
Dashed lines indicate the cutoff of absolute log, (effect size) =1.c, Euler plot of 
mis-spliced intron clusters in CLL with U1 mutations or SF3B1 mutations. d, ¢- 
distributed stochastic neighbour embedding plot showing CLL with U1 
mutations or SF3B1 mutations can be well-separated on the basis of mis-splicing 
patterns. e, Kaplan-Meier plot for time to first treatment in CLL. Patients with U1 
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promoter in one of two HCC projects (project code LIHC-US) (Fisher’s 
q= 0.016). We found that none of the samples with U1 mutations had 
SF3B1 mutations in CLL, although a larger dataset is needed for sufficient 
power to show mutual exclusion. 

Because mutated SF3B1 is also known to induce global splicing 
changes”, we compared samples with U1 mutation to samples with 
SF3B1 mutations in CLL. Consistent with previous findings, SF3B1 muta- 
tions induced many cryptic 3’ splice-site splicing events”° (Extended 
Data Fig. 8a—c). Both mutations induced numerous mis-splicing events 
with small effect sizes (Fig. 4b), but tended not to share events (Fig. 4c, 
d). Using an exon-centric method, we found that CLL with U1 mutations 
tends to induce intron retention and suppress exon skipping, whereas 
CLL with SF3B1 mutations demonstrated the opposite trend (Extended 
Data Fig. 8d, e). Notably, introns excised and exons retained in CLL with 
U1 mutations were enriched for the G6 5’ splice site (Extended Data 
Fig. 8f). 

We next investigated the clinical relevance of the g.3A>C mutation. 
CLLhas two major subtypes: one subtype in which the immunoglobulin 
heavy-chain variable regions (IGHV) are mutated (M-CLL), and a more 
aggressive subtype in which the IGHV are unmutated (U-CLL)”. The 
g.3A>C mutation was frequently found in cases of U-CLL (12 out of 105, 
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Relationships between major risk factors and the g.3A>C mutation in cases of 
HCC. Pvalues are from the Cochran-Mantel-Haenszel test (project code LIHC- 
US, n=335; project code LIRI-JP, n=124; definitions of project codes are 
provided in the Methods). HBV, infection with hepatitis B virus; HCV, infection 
with hepatitis C virus. g, Box plot for age at diagnosis in cases of HCC. Pvalues 
are from two-sided Wilcoxon rank-sum tests. The LIHC-US has 15 patients with 
the g.3A>C mutation (MUT) and 336 without (WT); the LIRI-JP has 13 MUT and 
116 WT. Centre line, median; box limits, 25th and 75th percentiles; whiskers, 1.5 
interquartile range; jitter points, individual samples. 


11.4%) (Fisher’s exact test P=5.6 x 10°) (Fig. 4a) but notin M-CLL (0 out 
of 173). The mutation was also not found in monoclonal B cell lympho- 
cytosis (MBL; n =29 cases), a lesion that precedes the overt leukaemia 
phase”. Moreover, the g.3A>C mutation in CLL was significantly asso- 
ciated with a shorter time to first treatment (log-rank test P=1* 10°), 
which indicates a more aggressive disease. The correlation with time to 
first treatment was significant even after adjusting for known prognostic 
markers, including disease stage (Binet stage), SF3B1 mutations and 
IGHV status”? (multivariate Cox model P= 0.043) (Fig. 4e, Extended 
Data Fig. 10b). However, we observed no difference in overall survival 
between cases of CLL with U1 mutation and wild-type U1 (Extended 
Data Fig. 10a). We noted that 7 out of 10 cases of CLL with U1 mutation 
involved early-stage disease (Binet stage A) (Extended Data Fig. 10c), 
a hint that the mutation may appear at an early phase of the disease. 

HCC has multiple risk factors, including infection with hepatitis B or C 
virus and heavy alcohol use”’. We found that the U1 mutation was associ- 
ated with increased alcohol intake (Mantel-Haenszel test, P= 0.0031) 
but not with infection with hepatitis B or C virus (Fig. 4f). The mutation 
was also associated with increased age at diagnosis, but not with survival 
(Fig. 4g, Extended Data Fig. 10d-f). As in CLL, the mutation was also 
found in early disease stages of HCC (Extended Data Fig. 10g). 
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Here we provide an example of recurrent mutations in a noncoding 
splicing factor across multiple types of cancer. Splicing factor muta- 
tions in SF3B1 and SRSF2that have previously been identified have been 
thought to promote tumorigenesis by inducing transcriptome-wide 
splicing changes that are subtle overall**. The same global effect is 
observed here for the U1 mutation. We also find mis-splicing of mul- 
tiple known or putative cancer genes in cases of CLL and HCC with U1 
mutations, which supports the theory that the tumorigenic effects of 
spliceosomal mutations are mediated by the production of specific 
aberrant isoforms”, although detailed functional analysis is required 
to confirm the role of these isoforms. 

The U1 mutation has potential clinical applications. Besides its use 
as an independent prognostic marker in CLL, the mutation may also 
represent an opportunity for treatment. Inhibitors of SF3B1 have previ- 
ously been demonstrated to preferentially kill tumour cells that contain 
splicing factor mutations via synthetic lethality”*”’; this may also work 
for tumours with U1 mutations. Alternatively, one might also target 
specific mis-spliced isoforms—such as the cell-surface protein CD44—via 
oligonucleotides or antibodies”*”*. Genomic regions suchas the U1 gene 
locus described here are generally overlooked in cancer sequencing 
studies. Future driver discovery studies that focus on these difficult 
regions might discover additional noncoding cancer drivers. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Data collection 

All samples used in this study were from participants recruited and 
anonymized by individual International Cancer Genome Consortium 
(ICGC) and The Cancer Genome Atlas (TCGA) projects. Written informed 
consent was obtained from all human participants through individ- 
ual projects. The PCAWG dataset consisting of 2,583 donors across 
37 tumour types was collected from the ICGC Data Coordination Center 
(ICGC DCC). For whole-genome sequencing, all tumour and paired 
normal aligned BAMs (n=5,166) were retrieved. The use of PCAWG data 
was approved by the University of Toronto Research Ethics Board under 
RIS Human Protocol Number 30278 and protocol title ‘Pan-cancer Analy- 
sis of Whole Genomes: PAWG’. 

For CLL, atotal of 141 normal-tumour paired whole-genome sequenc- 
ing and 299 RNA-seq from 318 donors were used in this study (122 donors 
with both). This cohort included 29 cases of high-count monoclonal 
Bcelllymphocytosis (MBL) of the CLL-type, an early phase of overt CLL. 
In addition to data from the PCAWG lymph-CLL cohort (n = 90) that 
originated fromthe CLLE-ES (Chronic Lymphocytic Leukaemia - Spain) 
project, we also incorporated additional CLLE-ES data deposited in the 
European Genome-phenome Archive (EGA) and ICGC DCC (data release 
27). Allsamples in this cohort had been studied before any treatment. 
The specific clinical and biological data of these cases have previously 
been described”. The use of genomic data, clinical dataand CLL samples 
was approved by the Hospital Clinic of Barcelona Institutional Review 
Board under protocol number HCB/2015/0814 and protocol title ‘Func- 
tional and Clinical Impact of Genomic Analysis in CLL’. 

For HCC, 315 normal-tumour paired whole-genome sequencing 
and 387 tumour RNA-seq from 613 HCC donors were used in total 
(89 donors had both forms of data available). The PCAWG liver-HCC 
cohort (n= 315) included data from four projects: LICA-FR (n=5; Liver 
Cancer - France), LINC-JP (n= 28; Liver Cancer - National Cancer Center, 
Japan), LIRI-JP (n= 229; Liver Cancer - RIKEN, Japan) and LIHC-US (n=53; 
Liver Hepatocellular Carcinoma - TCGA, US). Additional HCC samples 
fromthe LIHC-US project were collected from the National Cancer Insti- 
tute Genomic Data Commons (NCI GDC). 

All genomic data included in this study used GRCh37 as the reference 
genome and GENCODE v19 as the reference gene annotation”. 


Mutation calling for U1 

First, samples without enough coverage were flagged as genotype-unde- 
termined and left to manually review. The coverage was determined by 
the median read depth at the 5’ splice-site recognition sequence of seven 
Ul genes. For 2,434 donors with enough coverage (>15 median coverage 
inat least five Ul genes), all reads mapped by BWA MEM to U1 genes and 
pseudogenes as well as their flanking 1-kb regions were extracted with 
samtools and saved as miniBAMs”. These miniBAMs were then con- 
verted into paired FASTQ files and re-aligned with Bowtie2 (v.2.3.4.1) to 
GRCh37 in multiple mapping report mode (-k)”. Non-default parameters 
for Bowtie2 were ‘—score-min L,-0.3,-0.3-no-mixed-no-discordant -k 
100-very-sensitive’. Then, for each pair of multiple mapped reads, only 
alignments with minimal total edit distance (sum of edit distance in two 
mates) were kept. Reads mapped to U1 pseudogenes or other genomic 
regions were discarded. Next, for each re-aligned BAM, we counted the 
number of variant reads and the read depth (number of reference reads 
+number of variant reads) for each position, and for forward and reverse 
strand separately. To account for multiple mapping, we performed an 
extra procedure only for the read depth counting: that is, when a read 
had k equally good alignments, we only counted it as 1/k read. We then 
used a beta-binomial error model trained on a project-specific panel 


of normal samples to call mutations, which was implemented with a 
modified version of EBCall®. Finally, we used IGV to manually curate all 
mutation calls and filtered out mutations that were supported by reads 
with multiple mismatches in the same gene, or that had three or more 
variant reads in the paired normal sample according to BWA MEM or 
Bowtie2 alignments™. To further minimize the false-negative rate for 
the g.3A>C mutation, we also assigned tumours that were called as wild 
type but that had two or more variant reads at the third base of any U1 
genes to the undetermined group. 


RNA-seq data processing 

To analyse additional samples and PCAWG samples together, we uni- 
formly processed additional CLLand HCC RNA-seq data with a slightly 
modified version of the PCAWG RNA-seq STAR 2-pass pipeline**”*. To 
maximize the sensitivity of novel junction discovery, we added a cus- 
tomized junction file that was extracted from PCAWG STAR alignments. 
Gene-level expression was counted by htseq-count (v.0.9.1)*”. Transcript- 
level expression was estimated by Kallisto (v.0.44.0)°*. The quality con- 
trol process was done with FastQC (v.0.11.7) and multiQC (v.1.5)””. The 
transcript integrity number was calculated with RSeQC (v.2.6.4)*®. In this 
study, we kept only RNA-seq data that met the following criteria: first, 
FASTQ files passed at least three main FastQC flags (overrepresented 
sequences, per base Ncontent, per base sequence quality, per sequence 
GC content and per sequence quality scores); second, more than 50% 
reads were uniquely mapped and the total number of reads mapped by 
STAR was greater than 1 million; third, the total number of fragments 
counted by htseq-count was greater than 5 million; and fourth, the tran- 
script integrity number was greater than 50. 


Differential splicing and expression analysis 

For intron-centric differential splicing analysis, the LeafCutter package 
was used to quantify intron usage and identify differentially spliced 
intron clusters between two conditions”. Splice junction files (SJ.out. 
tab) generated by STAR were used as input for LeafCutter. Only splice 
junctions supported by uniquely mapped reads and with at least 6-bp 
maximum overhang were used. An intron was considered as significantly 
differentially spliced when g< 0.1 and absolute log, (effective size) > 1. 
The limma package was used for differential expression analysis”. 
Gene-level expression from htseq-count was used as input. A gene was 
considered as significantly differentially expressed when q < 0.1 and 
absolute log,-transformed fold change > 1. For CLL, we used the IGHV 
status (U-CLL or M-CLL) as the covariate, and compared 11 tumours with 
U1 mutation and wild-type SF3B1 and 26 tumours with wild-type Uland 
SF3B1 mutation with 254 tumours with wild-type U1 and SF3B1. For HCC, 
we used project code (LIRI-JP or LIHC-US) as the covariate to control 
for batch effects and compared 20 tumours with U1 mutation with 367 
tumours with wild-type U1. We also used randomized comparisons as 
controls by permuting ‘MUT’ and ‘WT labels for both differential splic- 
ing and differential expression analysis. 

We also performed exon-centric differential splicing analysis for 
U-CLL (6 cases with U1 mutation and wild-type SF3B1, 6 cases with 
wild-type U1 and SF3B1 mutation, and 30 cases with wild-type Ul and 
SF3B1) using the rMATS package (v.4.0.2) with default parameters”. 
Differential splicing events with g < 0.1 and absolute APSI > 0.1 were 
considered as significant. 


Inference of U1 g.3A>C status 

Separate models were built for CLL and HCC. Cross-validations were 
used to compare different models and settings (Supplementary Note). 
For CLL, we used RNA-seq data for 7 cases with U1 mutation and 60 cases 
with wild-type U1 as training data, and RNA-seq data from 232 cases 
as test data. For HCC, we used RNA-seq for 10 cases with U1 mutation 
and 60 cases with wild-type U1 as training data, and RNA-seq data from 
317 cases as test data. For splicing-based models, training data were 
used to identify differentially spliced introns (3,174 features for CLL and 
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600 features for HCC) and the use of these introns was then used to 
train a random forest classifier with 100 trees. For expression-based 
models, training data were used to identify differentially expressed 
genes (502 features for CLL) and normalized expression of these genes 
was then used to train a random forest classifier with 100 trees. Finally, 
t-distributed stochastic neighbour embedding (t-SNE) was used to verify 
and visualize all predictions”. 


Calculation of ABCD3 splice score 

The ABCD3 splice score for CLL was built with the number of uniquely 
mapped RNA-seq reads that support cryptic splice junctions (Neypric) 
or annotated splice junctions (Mannotatea)- IN total, one annotated junc- 
tion (chr1: 94,946,163-94,948,725) and four cryptic junction (chr: 
94,946,163-94,948,144, chr1: 94,946,163-94,946,964, chr1: 94,948,575- 
94,948,725 and chr1: 94,947,112-94,948,144) were used. Then, the score 
was calculated as follows: 


ABCD3 splice score = NOB Lies accateal aniistnaa® Menace} 


Ahighscore (21) indicated that the patient with CLL was a carrier of the 
g.3A>C mutation. 


PCR-based SNP assay 

Genomic DNA from 298 primary samples was tested using custom 
rhAmp SNP assays (Integrated DNA Technology). In brief, locus and 
allele-specific primers were generated individually for RNU1_batch 
(RNUI-1, RNUI-2, RNU1-3, RNU1-4 and RNVU1-18) and RNUI_pseudo 
(RNU1-27P and RNU1-28P). Assays were run in technical triplicate in 5 
pl volume (DNA concentration sampled at least 10 ng), with control 
gBlocks for wild-type, mutant and heterozygous genotypes. Reporter 
mix used Yakima Yellow (mutant) and FAM (wild-type) dyes as well as 
ROX dye for passive reference. Plates were read on the StepOnePlus 
(Applied Biosystems) RT-PCR machine, and genotypes called using 
the StepOne v.2.3 software. The primer sequences are available in Sup- 
plementary Table 6. 


Cell lines and exogenous expression of the U1 g.3A>C mutation 
The pLKO.1-puro U6 sgRNA BfuAl stuffer lentiviral vector (Addgene) 
was modified by removing the internal U6 promoter (between Ndel and 
EcoRI), and replacing it with the U1 locus, including 393 bases of internal 
promoter, the U1 sequence and 39 bases of 3’-flanking region using 
the following oligonucleotides (U1-For_EcoRI: 5’-GTCGAGAATTCTTG- 
GCGTACAGTCTGTTTTTG; U1-Rev_Ndel: 5’-CTATCATATGTAAGGAC- 
CAGCTTCTTTGGGA). The g.3A>C mutation was introduced by PCR with 
the following oligonucleotides (U1-A_C-for: 5’-GCCAGGTAAGGATGAGA- 
TCTTCGGG; U1-A_C-rev: 5’-CCCGAAGATCTCATCCTTACCTGGC) in com- 
bination with the corresponding previous primers. The PCR products 
were digested with Ndel and EcoRI, and cloned in the modified pLKO.1 
plasmid. All plasmids were verified by Sanger sequencing. 

CLL cell lines J¥VM3 and HG3 were grown in RPMI 1640, 10% FBS, 1% 
PSG and 1% AA; MEC1 was grown in IMDM, 10% FBS, 1% PSG and 1% AA; 
and the HEK-293T cell line was grown in DMEM, 10% FBS, 1% PSG. CLL 
cell lines HG3, MEC1 andJVM3 were obtained from DSMZ (https://www. 
dsmz.de/catalogues/catalogue-human-and-animal-cell-lines.html). The 
authenticity of the cell lines was tested with the AMpFLSTR Identifiler 
Plus PCR Amplification Kit. CLL cell lines have tested negative for myco- 
plasma. For the production of lentiviral particles, the protocol from the 
manufacturer (Addgene) was used with minor modifications. Thus, HEK- 
293T cells (5 x 10° cells) were cultured in 10-cm plates and transfected 
using Lipofectamine Plus (Invitrogene) with 2 pg of either pLKO.1-U1™ 
(containing the wild-type U1 locus) or pLKO.1-U12*"”¢ (containing the 
g.3A>C mutation), together with 1 pg of psPAX2 packaging plasmid 
and 1 pg of pMD2.G envelope plasmid. Twelve hours after transfection, 
the medium was replaced with complete medium, and 24 h later 10 
ml of supernatant were filtered (0.45 pum), and 4 ml was used to infect 


CLL cell lines in the presence of 8 pg/ml polybrene. The infection was 
repeated 24 h later, and after 24 h cells were plated in complete medium 
for one day, and then selected with 1.2 pg/ml of puromycin. Cells were 
selected for four days, and total RNA was extracted with the Trizol 
method. 


Verification of the expression of the U1 g.3A>C mutation by 5’ 
rapid amplification of cDNA ends 

Rapid amplification of cDNA ends (RACE) was performed using 1 pg 
of total RNA from JVM3, HG3 or MEC1 cell lines infected with either 
pLKO.1-U1™ or pLKO.1-U12**”* following the recommendations of the 
manufacturer (Sigma-Aldrich), and the following specific oligonucleo- 
tides (U1-RACE_SP1: 5’- CAGGGGAAAGCGCGAACGCAGT; U1-RACE_SP2: 
5’- CCCACTACCACAAATTATGC). A single amplification band of the 
expected size (160 bp) was excised from the gel, purified and sequenced 
with the internal oligonucleotide U1-RACE_SP2. 


RNA-seq and data analysis for CLL cell lines 

In total, 12 libraries—including 2 technical replicates for each of the 
3 cell lines JVM3, HG3 or MEC1) and 2 conditions (mutation or wild 
type)—were prepared as stranded total RNA-seq libraries and then 
sequenced with the Illumina HiSeq 4000 system (2 x 76 bp) with >40 
million paired-end reads per sample. Cell line RNA-seq data were pro- 
cessed and analysed the same way as were primary tumour RNA-seq 
data. For differential splicing analysis, the same set of intron clusters 
used in primary CLL was tested in cell lines, so that their results were 
directly comparable. For the overlap test, a one-tailed Fisher’s exact 
test was used. 


RT-PCR and qPCR validation of mis-splicing events in primary 
CLL 

For PCR with reverse transcription (RT-PCR), RNA was obtained for 
samples from 14 patients with CLL, including 6 cases of U-CLL with U1 
mutation, 4 cases of U-CLL with wild-type U1 and 4 cases of M-CLL with 
wild-type U1. cDNA was synthesized using the iScript cDNA Synthesis Kit 
(Bio-Rad 1708890). PCRs were performed using 1 1 cDNA and the Taq 
PCR Master Mix Kit (Qiagen 201445) using 35 cycles. Products were run 
onthe QIAxcel Advanced System (Qiagen). For quantitative PCR (qPCR), 
the same set of CLL samples was used except that one case of U-CLL with 
U1 mutation was exhausted. qPCR was performed using 1 1 cDNA and 
the PowerUp SYBR Green Master Mix (Applied Biosystems A25742) in 
duplicates in a StepOnePlus Real-Time System (Applied Biosystems). 
Relative quantification was analysed with the 2°“ method using GAPDH 
as the endogenous control. The primer sequences are available in Sup- 
plementary Table 6. 


Gene-set overrepresentation and enrichment analysis 

We identified gene lists that were significantly overrepresented in differ- 
entially spliced genes from Gene Ontology, Kyoto Encyclopedia of Genes 
and Genomes and Reactome databases using g:Profiler** *”. We also con- 
ducted gene-set enrichment analysis (GSEA) for differentially expressed 
genes using pre-ranked gene lists ordered by —log,,(P value) x (sign of 
fold change)”. Both classical and weighted enrichment statistics were 
used in GSEA. For GSEA, we focused on C2 (curated) and C5 (Gene Ontol- 
ogy) gene sets in the Molecular Signatures Database (MSigDB v.6.2)**. 


Mutual-exclusivity and co-occurrence analysis 

We collected lists of CLL and HCC driver alterations from the litera- 
ture””’, All CLL samples and whole-genome-sequenced HCC samples 
were used in the analysis. HCC samples were analysed separately based 
on project (LIRI-JP or LIHC-US), and as acombined cohort. To determine 
the pairwise significance between the U1 mutation and other driver 
events, we used the Cochran-Mantel-Haenszel ’ test for the combined 
HCC cohort, and Fisher’s exact test for each project. As the detection 
of TERT promoter mutations was underpowered in many PCAWG HCC 


samples (especially for the LIRI-JP project), we also included rescued 
TERT promoter mutations as previously described”. 


Clinical data analysis 

All clinical data analysed here have previously been described””°*". 
Patient outcomes were analysed with the log-rank test for a single vari- 
ate and Cox proportional hazards regression model for multivariates. 
For CLL, we analysed overall survival and time to first treatment from 
the time of sampling. Cases with MBL were not included in the outcome 
analysis. For HCC in LIRI-JP, we only analysed overall survival. For HCCin 
LIHC-US, we analysed two endpoints (overall survival and progression- 
free interval) as recommended by the TCGA PanCancer Atlas”. Age at 
diagnosis between mutated and unmutated groups was tested with two- 
sample Wilcoxon rank-sum tests. The association between U1 mutations 
and categorical patient characteristics (such as gender, IGHV status, 
infection with hepatitis B or C virus and alcohol history) were analysed 
with Fisher’s exact test. For alcohol history, two HCC projects used dif- 
ferent indicators. For LIHC-US, we used binary alcoholic liver disease 
history. For LIRI-JP, we collapsed its four-level alcohol intake indicators 
(a, no alcohol intake; b, social drinker; c, about 60 g every day; d, 60g 
and more every day) into a binary factor (0, a and b;1,c and d). 


Statistical analysis 

Allstatistical tests were two-sided unless otherwise stated. All statistical 
methods are described in the corresponding sections and P< 0.05 was 
considered as significant when only a single test was performed. All false- 
discovery rate controls were conducted with the Benjamini-Hochberg 
procedure and false-discovery rate of 10% (q< 0.1) was selected as the 
significant threshold. 


Code availability 


All published computational programs used in this study are indicated in 
corresponding sections. Scripts used to perform U1snRNA mutational 
calling are available at https://github.com/smshuai/U1-snRNA. 


Data availability 


PCAWG data are available at ICGC DCC (https://docs.icgc.org/pcawg/ 
data/; donor identifiers in Supplementary Table 1). Additional CLL 
data (donor identifiers in Supplementary Table 3) are available at 
ICGC DCC (https://dcc.icgc.org/releases/release_27/Projects/CLLE- 
ES) and EGA (raw data under accession numbers EGASO0001000374 
and EGASO0001001306). Additional HCC data are available at GDC 
Data Portal (raw and processed data under project code TCGA-LIHC; 
donor identifiers in Supplementary Table 3). CLL cell line RNA-seq data 
are available at GSE134197. 
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Extended Data Fig. 1| Overview of genotyping methods and results. a, to infer the U1g.3A>C status on the basis of expression patterns. d,e, U1g.3A>C 
Computational pipeline used to identify somatic mutations in U1 genes from status for 318 CLL (d) and 613 HCC samples (e). Consensus genotypes are usedin 


whole-genome sequencing data. b, Computational pipeline used to infer the U1 downstream analysis. One hundred and three undetermined HCC samples are 
g.3A>C status on the basis of splicing patterns. c, Computational pipeline used excluded from downstream analysis. Grey indicates data not available. 
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Extended Data Fig. 2| Global transcriptomic changes in CLL and HCC. a, 
Category of mis-spliced introns. Black boxes indicate exons and lines indicate 
introns; red lines indicate actual splice junctions. Ajunctionis described as 
‘annotated’ if the junction matches any annotated introns. Ajunctionis 
described as ‘novel annotated’ if both splice sites are annotated but not paired. A 
cryptic 5’ splice site junction exists if only the 3’ splice site is annotated. Acryptic 
3’ splice site junction exists if only the 5’ splice site is annotated. b, Euler plots of 
mis-spliced intron clusters in CLL and HCC. Increased excision and decreased 
excision represent intron clusters that have significantly mis-spliced introns 
with APSI>O and APSI<O, respectively. c, 5’ splice site for introns with increased 
(n=239) or decreased excision (n = 294) in HCC with U1 mutation. Top, bar plot 


shows Benjamini-Hochberg adjusted P values from x’ tests; red line indicates q 
=0.1.d, Category of mis-splicing events in HCC with U1 mutation. e, Pvalue 
quantile—quantile plots for differential expression analysis. Global gene- 
expression changes cannot be detected in the randomized dataset. Pvalues are 
from limma. Biologically independent patient samples are used for CLL (11 cases 
with U1 mutation versus 254 cases with wild-type U1) and HCC (20 cases with U1 
mutation versus 367 cases with wild-type UD). f, g, SNE plots showing the 
accuracy of transcriptome-based inferences. f, SNE coordinates are computed 
from differentially spliced introns, and predictions are made with the splicing- 
based classifier. g, SNE coordinates are computed from differentially 
expressed genes, and predictions are made with the expression-based classifier. 
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isoforms for MSI2, POLD1 and ABCD3 in CLL with U1 mutation. a, Coding without the g.3A>C mutation, respectively. In the box plot, centre line, box 
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Extended Data Fig. 4 | Validation of mis-splicing in MSI2, POLD1, CD44 and 
ABCD3. a, Junctions used in RT-qPCR validations. For MS/2, POLD1 and ABCD3, 
one annotated junction and one cryptic junction are used. For CD44, one CD44s 
junction and one CD44v junction are used. Cryptic and CD44v junctions should 
be over-excised insamples with U1 mutation. b, RT-PCR result for junctions 
shownina. MUT (n=6) and WT (n=8) represent independent CLL samples with 
and without the g.3A>C mutation, respectively. GAPDH is used as control. 


The length of all PCR products is shown on the right. The experiment was 
conducted once. For gel source data, see Supplementary Fig. 1.c, RT-qPCR 
result for junctions shown ina. Expression of cryptic and CD44v junctions are 
significantly higher in CLL samples with U1 mutation, as expected. Pvalues are 
from two-sided Wilcoxon rank-sum tests. MUT (n=5) and WT (n=8) represent 
biologically independent CLL samples with and without the g.3A>C mutation, 
respectively. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | Genes and pathways altered in CLL with U1 mutation. 
a,c, Representative upregulated (a) and downregulated (c) gene sets in CLL with 
U1 mutation (n=11independent patients). For a,c, enrichment score (ES), 
normalized enrichment score (NES) and false-discovery rate (FDR) gare from 
the permutation test of GSEA. Genes in the xaxis are sorted from the most 
significantly upregulated gene to the most significantly downregulated gene. 
The dashed line indicates fold change =1. b, Expression box plots for TERF2and 
TERT in the telomere maintenance pathway. q values are from limma. MUT 
(n=11) and WT (n=254) represent biologically independent CLL samples with 
and without the g.3A>C mutation, respectively. d, e, Expression heat map of 
ribosomal genes and BCR genes. d, Only the top 10 differentially expressed 
ribosomal genes are shown. e, Right, bar charts show the log,-transformed fold 
change and -log,)(q value) from limma (n=11independent samples of CLL with 


U1 mutation; n= 254 independent samples of CLL with wild-type U1). 

f, Expression box plot for TP53. TP53 MUT/DEL (n=15 independent patients) 
represents CLL samples with 7P53 mutation and/or deletion; UIMUT 

(n=10 independent patients) represents samples with the g.3A>C mutation; WT 
(n=273 independent patients) represents the rest of the CLL samples. One 
sample with both 7P53 and g.3A>C mutation is excluded. P values are from two- 
sided Wilcoxon rank-sum tests. g, Expression box plots for genes in the NF-KB 
(CHUK and IKBKB), PI3K/AKT (PIK3CA and AKT1) and MAPK/ERK VJJUN) pathways. 
These pathways are downstream effectors of the BCR pathways. q values are 
from limma. MUT (n=11) and WT (n=254) are the sameas in b. In all box plots, 
centre line, box limits, whiskers and points represent median, 25th and 75th 
percentiles, 1.5x interquartile range and individual samples, respectively. 


Article 


a 
JVM3 U1 WT 


TGGTATCTCCCCTGCCAGGTAAGTATAAAAAAAA 


5’RACE 


JVM3 U1 g.3A>C 


TGGTATCTCCCCTGECCAGGETAAGTATAAAAAAAA 


5’RACE 
primer 


HG3 U1 WT MEC1 U1 WT 
TGGTATCTCCCCTGCCAGGTAAGTATAAAAAAAA UN MNNT cTCCCCTGCCAGGTAAGT ACAAAAAAAA 
43 5’RACE 5’RACE 
primer primer 


y 


MEC1 U1 g.3A>C 


TGGTATCTCCCCTGCCAGGTAAGTACAAAAAAAA 


HG3 U1 g.3A>C 


TGGTATCTCCCCTGCCAGGTAAGTATAAAAAAAA 


S’RACE 
primer 


5’RACE 
primer 


+3 


b c Deseased excision e Over-excised G6 5’SS introns 
= 80 2500 
B40 2000 
. _ mae Cell line 
Decreased excision 1000 1000 
, 2 500 
2 
fc 0 
Gs G s Increased excision Mic = (eid change) Exp. overlap = 26.9 (p = 8.7510") 
2 0 = MYUACS < 2500 BB G ‘ 
8 | dexcisi 2 2000 Mu f Up-regulated genes g Down-regulated genes 
8 ncreased excision 
& 1 1500 
1000 
Patient Patient 
G G 500 tel ae ; 
—— 
-2-1 1 456 5d a2 go 5o> ioe 
Rea in 5'SS po a6 WG we 
yo Exp. overlap = 4.72 (p = 3.4«10°) Exp. overlap = 0.84 (p = 0.00019) 
h JVM3 HG3 MEC1 
~ & 2 log2FC > 1 log2FC > 1 2% log2FC > 1 
o n= 236 n=673 
D 2.54 2.54 
c 
oO 
= 
Oo 
ZS 0.04 0.04 
[o} 
Ww 
N 
8 
=e : s'% log2FC <-1 | “254 ° log2FC < -1| 257 | 6 log2FC <-1 
n=211 n= 450 n= 309 
0 10 0 10 0 10 


Mean log2 RPKM 


Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Exogenous expression of the g.3A>C mutation in cell 
lines induced transcriptome-wide changes. a, 5’ RACE confirming the 
expression of the U1g.3A>C mutation in three CLL cell lines. CLL cell lines (JVM3, 
HG3 and MEC]I) are infected with lentiviral particles that contain U1locus with or 
without the g.3A>C mutation (g.3A>C or WT, respectively). The 
electropherograms correspond to the sequence of the PCR product (reverse- 
strand). The arrowheads indicate the location of the 3rd base of U1. The location 
of the 5’ RACE primer is also indicated. The experiment was conducted once. 

b, 5’ splice site for introns with increased or decreased excision in CLL cell lines 
with U1 mutation (n =3 biological independent cell lines). Top, bar chart shows 
qvalues from y” tests for base composition difference; red line indicates the q 
=0.1cutoff.c, Category of mis-splicing events in CLL cell lines. Extended Data 


Figure 2a provides the definition of each category. The number of introns is 
coloured by the sixth base of 5’ splice site. d, Volcano plot for differential 
expression analysis of CLL cell lines. g values are from limma. n= 3 biological 
independent cell lines; each cell line has a form with (MUT) and without (WT) U1 
mutation. e-g, Euler plots comparing differentially spliced introns (e) and 
differentially expressed genes (f, g) between primary CLL from patients and cell 
lines. e, Over-excised introns with the G6 S’ splice site that are direct targets of 
the U1 mutation are compared. Pvalues are from one-tailed Fisher’s exact test. 
The expected number of overlaps is also shown. h, Asymmetric changes in gene 
expression for cell lines. In all three cell lines, more genes are upregulated (log,- 
transformed fold change > 1) than downregulated (log,-transformed fold 
change <-l). 
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Extended Data Fig. 7 | Driver events and clinical features of CLL. Left, Ul1g.3A>C status are shown on the top. Right, bar plot shows q values from two- 
oncoprint shows 36 CLL driver alterations across 313 patients with CLL. Clinical sided Fisher’s exact test that compares each alteration with the U1g.3A>C status. 
classification (CLL or monoclonal B cell lymphocytosis), gender, IGHV status and 
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Extended Data Fig. 8 |Consequence of SF3B1 mutation and its relation tothe 
Ul1g.3A>C mutationin CLL. a, Category of mis-splicing events in CLL with SF3B1 
mutation. Unlike the g.3A>C mutation, SF3BI mutation induces more cryptic 
3’ splice site changes. b, c, Sequence motifs of 5’ splice site (b) and 3’ splice site 
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Extended Data Fig. 10 | Clinical analysis of CLL and HCC. a, Kaplan-Meier plot 
of overall survival for CLL. Pvalue is from the Cox model. b, Independent 
prognostic value of the U1g.3A>C mutation in CLL. Multivariate Cox model 
results for time to first treatment in 301 patients with CLL (number of 

events = 153). HR, hazard ratio; Cl, confidence interval. c, Ul mutation frequency 
by disease stage in CLL. Aand B/C, Binet A and B/Cstages. d, Kaplan-Meier plot 
of overall survival for HCC samples in the LIRI-JP project. MUT indicates samples 
with the g.3A>C mutation and WT indicates samples without the g.3A>C 


mutation; 13 MUT and 116 WT samples are used. e, f, Kaplan-Meier plots of OS (e) 
and progression-free interval (f) for HCC samples in the LIHC-US project. MUT 
indicates samples with the g.3A>C mutation and WT indicates samples without 
the g.3A>C mutation; 15 MUT and 335 WT samples are used. For d-f, Pvalues are 
from two-sided log-rank tests; g, U1 mutation frequency by disease stage in HCC. 
For LIRI-JP, staging uses the Liver Cancer Study Group of Japan system. For LIHC- 
US, staging uses the American Joint Committee on Cancer system. 
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Data collection No code was used for data collection. 


Data analysis The following published softwares were used in this study: Bowtie2 (v2.3.4.1); STAR (v2.5.4); htseq-count (v0.9.1); Kallisto (v0.44.0); 
multiQC(v1.5); RSeQC (v2.6.4); FastQC (vO.11.7); rMATS (v4.0.2); leafCutter (v0.2.7); StepOne (v2.3); limma (v3.38.3). Customized code 
for mutation calling have been deposited into GitHub (https://github.com/smshuai/U1-snRNA). 
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- Accession codes, unique identifiers, or web links for publicly available datasets 
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- Adescription of any restrictions on data availability 


PCAWG data are available at ICGC DCC (https://docs.icgc.org/pcawg/data/). Additional CLL data are available at ICGC DCC (https://dcc.icgc.org/releases/release_27/ 
Projects/CLLE-ES) and EGA (raw data under accession numbers EGASO0001000374 and EGASO0001001306). Additional HCC data are available at GDC Data Portal 
(raw and processed data under project code TCGA-LIHC). CLL cell line RNA-Seq data are available at GSE134197. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No sample size calculation was performed. For genomic and clinical data analysis, we used all available data. For the rhAMP assay, we tested 
all but one CLL sample with RNA sequencing data (n=298). For RT-PCR, we randomly selected at least four samples per condition, which is 
sufficient for two-samples Wilcox test. 
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Data exclusions To minimize false negatives, we excluded 103 HCC patients in the LIRI-JP project due to undetermined status for the U1 g.3A>C mutation. 
These samples do not have enough coverage for the U1 genes AND have no RNA-Seq data. These samples were excluded before any 
downsteam splicing/expression/clinical analysis. 


Replication For the rhAMP assay, we used technical triplicates. One donor which showed inconsistent results for rhAMP was reported in the paper. For 
the RT-PCR, we used 6 U1 mutated samples and 4 wild-type samples for each CLL subtype as biological replicates; results were consistent. For 
qPCR, we used 6 U1 mutated samples and 4 wild-type samples for each CLL subtype as biological replicates; results were consistent. For cell 
ine experiments, we used JVM3, HG3 and MEC1 as biological replicates and each condition (MUT or WT) was sequenced twice as technical 
replicates; results were consistent so we merged RNA-Seq reads from technical replicates. 


Randomization _ For differential expression and splicing analysis, we used randomized genotype labels (MUT or WT) as controls. 


Blinding ot relevant in this study as we studied the effect of a mutation from previously established patient cohorts. 
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Emerging evidence suggests that epigenetic regulation is dependent on metabolic 


state, and implicates specific metabolic factors in neural functions that drive 
behaviour’. Inneurons, acetylation of histones relies on the metabolite acetyl-CoA, 
whichis produced from acetate by chromatin-bound acetyl-CoA synthetase 2 (ACSS2)’. 
Notably, the breakdown of alcohol in the liver leads to a rapid increase in levels of blood 
acetate’, and alcohol is therefore a major source of acetate in the body. Histone 
acetylation in neurons may thus be under the influence of acetate that is derived from 
alcohol’, with potential effects on alcohol-induced gene expression in the brain, and on 
behaviour’. Here, using in vivo stable-isotope labelling in mice, we show that the 
metabolism of alcohol contributes to rapid acetylation of histones in the brain, and 
that this occurs in part through the direct deposition of acetyl groups that are derived 
from alcohol onto histones in an ACSS2-dependent manner. A similar direct deposition 
was observed when mice were injected with heavy-labelled acetate in vivo. Ina 
pregnant mouse, exposure to labelled alcohol resulted in the incorporation of labelled 
acetyl groups into gestating fetal brains. In isolated primary hippocampal neurons ex 
vivo, extracellular acetate induced transcriptional programs related to learning and 
memory, which were sensitive to ACSS2 inhibition. We show that alcohol-related 
associative learning requires ACSS2 in vivo. These findings suggest that there is a direct 
link between alcohol metabolism and gene regulation, through the ACSS2-dependent 
acetylation of histones in the brain. 


To determine whether acetate that is produced from the breakdown of 
alcohol contributes to dynamic acetylation of histones in the brain, we 
used in vivo stable-isotope labelling of protein acetylation, monitored 
by mass spectrometry’ (Fig. 1a). Acetyl groups derived from ethanol 
were rapidly incorporated into histone acetylation in the brain, bothin 
the hippocampus (Fig. 1b) and inthe prefrontal cortex (Extended Data 
Fig. 1c). The incorporation of labelled alcohol into histone acetylation 
was dynamic in the hippocampus, and heavy labelling decreased to base- 
line levels 8 h after intraperitoneal injection (Extended Data Fig. 2a, b). 
This rapid in vivo labelling of histone acetylation in the hippocampus was 
absent in mice that were injected with non-labelled alcohol (Extended 
Data Fig. 2d). Similar labelling occurred in the liver (Fig. 1c), whichis the 
principal site of alcohol metabolism and expresses high levels of ACSS2”°. 
By contrast, the abundance of labelled acetylated histones was lower 
in skeletal muscle (gastrocnemius muscle; Extended Data Figs. 1d, 2e), 
which has relatively lower ACSS2 expression levels’. 

To test whether ACSS2 is required for the incorporation of alcohol- 
derived acetate into the brain, we attenuated the expression of ACSS2 
in the dorsal hippocampus (dHPC) by shRNA knockdown using a pre- 
viously validated viral vector’. In these ACSS2-knockdown mice, we 
compared alcohol-derived histone acetylation separately in the dHPC, 
in which ACSS2 was reduced, and in the ventral hippocampus (vVHPC), 


in which ACSS2 expression was not affected. Notably, knockdown of 
ACSS2 prevented the incorporation of heavy-labelled acetyl groups 
derived from alcohol into histone acetylation in the dHPC (Fig. 2a). 
By contrast, in the same mouse, incorporation of the heavy label into 
histone acetylation in the VHPC was not affected (Fig. 2b). These in vivo 
data indicate that acetate that is derived from the metabolism of alcohol 
in the liver is transported to the brain and readily incorporated into 
ACSS2-dependent histone acetylation (see Supplementary Information 
for further discussion). 

Although the majority of alcohol metabolism takes place in the liver, 
alcohol fractions may also be converted to acetate in the brain’® by the 
enzymes catalase and cytochrome P450 2E1 (CYP2E1). We therefore 
assessed the contribution of acetyl groups that are derived from extra- 
cellular acetate to histone acetylation in the brain. In mice that were 
intraperitoneally injected with 2 g kg‘ deuterated acetate (acetate-d3), 
the labelled acetate was rapidly incorporated into histone acetylationin 
the brain, at similar levels in both hippocampus and cortex (Extended 
Data Fig. 2f, g). Relative levels of the labelled acetate were highest at 
30 min and returned to background levels at 4 h after injection, indi- 
cating that acetate-derived acetyl groups were quickly incorporated 
into brain histone acetylation and showing the rapid turnover of this 
process. Notably, we found that levels of acetate in the hippocampus 
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Fig. 1| Alcohol metabolites contribute to histone acetylation in the brain. 

a, Experimental outline of in vivo ethanol-d6 mass spectrometry. Asterisk 
indicates the isotopic shift owing to label incorporation, corresponding tothe 
d3-labelled form of the acetylated peptide. HPC, hippocampus. b, Metabolized 
heavy ethanol-d6 is incorporated into histone acetylation in the hippocampus. 
The axis of the Arachne plot represents the percentage of the third isotope for 
the acetylated peptide, corresponding to the d;-labelled form; the natural 
relative abundance of that isotope is apparent inthe ‘none’ and ‘saline1h’ 
treatment groups. c, Label incorporation into histone acetylation occurs earlier 
in the liver, the principal site of alcohol metabolism. 


were significantly increased 30 min after injection of alcohol or acetate 
(Extended Data Fig. 2h), and we detected substantial amounts of heavy 
acetate in the hippocampus as early as 30 min after injection with deu- 
terated ethanol (ethanol-d6) (Extended Data Fig. 3a). 

We further investigated whether alcohol-derived carbon groups 
are incorporated into other key metabolites in hippocampal tissue. 
Although we detected no incorporation of the alcohol-derived label 
into pools of glucose and 3-hydroxybutyrate, and only a fraction (less 
than 1%) into the pool of lactate, we found that alcohol labels pools of 
glutamine in the hippocampus (Extended Data Fig. 3b-e). Inthe brain, 
de novo synthesis of glutamine in astrocytes replenishes the glutamate— 
glutamine cycle, in which glutamine is trafficked into glutamatergic 
neurons for production of the neurotransmitter glutamate. Citrate—the 
substrate that is used by ATP citrate lyase to produce nucleo-cytoplasmic 
acetyl-CoA—is generated from a-ketoglutarate, which can be derived 
from carboxylation of glutamine; this path could therefore provide 
another route through which alcohol contributes to histone acetylation. 
However, we detected only traces of the alcohol-derived label in pools of 
citrate or isocitrate in the hippocampus (Extended Data Fig. 3f). Taken 
together with our mass spectrometry data in ACSS2-knockdown mice 
(Fig. 2a, b), these results support the view that alcohol-derived acetate 
that contributes to the acetylation of histones in the hippocampus is 
converted to acetyl-CoA directly by ACSS2. Accordingly, our data sug- 
gest that the increased levels of blood acetate from alcohol metabolism 
promote ACSS2-mediated dynamic acetylation of histones in the brain. 

We examined the functional relevance of alcohol-derived acetate for 
ACSS2-dependent histone acetylation in regulating the expression of 
genes in the hippocampus. We found that treating wild-type mice with 
alcohol resulted in significant enrichment of peaks for K9-acetylated 
histone H3 (H3K9ac) and H3K27ac—both at key neuronal genes and 
genome-wide—and that this enrichment was greatly attenuated in 
ACSS2-knockdown mice (Fig. 2c-g; chromatin immunoprecipitation 
followed by sequencing (ChIP-seq) performed 1h after alcohol injec- 
tion). For example, we observed ACSS2-dependent and alcohol-induced 
histone acetylation at Fst/1 (follistatin-like 1) (Fig. 2c), aneuronal gene 
that has been implicated in the development and migration of neurons", 
and alcohol-induced enrichment of H3K27ac at Cep152 (centrosomal 
protein of 152 kDa) (Extended Data Fig. 4a), an important regulator of 
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genome integrity that is recurrently mutated in intellectual develop- 
mental disorders and microcephaly”. Another example is Uimcl (ubiq- 
uitin-interaction-motif-containing 1) (Extended Data Fig. 4b), which has 
previously been linked with neurodevelopmental disorders and autism”. 
Evaluating the ChIP-seq data for histone acetylation ona genome-wide 
scale, we found that 74% of the H3K9ac peaks that changed after expo- 
sure to alcohol were increased (339 out of 458 changed peaks called with 
MACS2; 10% false discovery rate (FDR) used as the significance threshold 
for DiffBind; Fig. 2d), and that 60% of the differential H3K27ac peaks 
were increased by alcohol (490 out of 816 peaks; ChIP-seq performed 1h 
after alcohol injection; (Fig. 2e). Notably, this response was eliminated 
in ACSS2-knockdown mice: 98% of the H3K9ac and H3K27ac peaks that 
increased in wild-type mice after alcohol treatment were not induced in 
the dHPC of ACSS2-knockdown mice (Fig. 2f, g). We then performed RNA 
sequencing (RNA-seq) to characterize the transcriptional response and 
found that H3K9ac and H3K27ac drove gene expression genome-wide 
in wild-type mice that were treated with alcohol (Extended Data Fig. 5a, 
b). However, in line with the ChIP-seq data, this response was blunted in 
ACSS2-knockdown mice (Extended Data Fig. 5c, d). A functional analysis 
of the genes that were both hyperacetylated and induced by alcohol in 
an ACSS2-dependent manner revealed enrichment of genes with func- 
tions in protein binding, cell junctions, postsynaptic density and drug 
response (Extended Data Fig. Se, f). Together, our in vivo findings show 
that treatment with alcohol leads to increased histone acetylation and 
transcriptional activity in the dHPC in an ACSS2-dependent manner. 

Because alcohol and acetate have pleiotropic effects on brain cir- 
cuitry and metabolism“, we developed an ex vivo assay to more closely 
model the direct effects of exogenous acetate on gene expression. We 
used isolated mouse primary hippocampal neurons, cultured for one 
week after isolation and subsequently treated with 5 mM acetate, to 
investigate the transcriptional response to supraphysiological levels of 
acetate that mimic the influx of exogenous acetate during alcohol intake. 
Furthermore, to determine the specific role of ACSS2 in transcriptional 
responses to acetate, we used a highly specific small-molecule inhibitor”’ 
of ACSS2 (ACSS2i; C,,H,sN,0,S,) (Extended Data Fig. 6a). 

In primary hippocampal neurons, supplementation with acetate 
induced the expression of 3,613 genes (Fig. 3a, Extended Data Fig. 6b). 
Using an analysis of Gene Ontology (GO) terms, we found that these genes 
are involved in nervous system processes, including signal transduction 
and learning and memory (Extended Data Fig. 6c). By contrast, treatment 
with acetate resulted in the downregulation of genes that are involved 
inimmune system processes (Extended Data Fig. 6d). Inthe presence of 
ACSS2i, 2,107 of the genes that were induced by acetate were no longer 
upregulated (Extended Data Fig. 6f), indicating that acetate-induced 
transcription relies heavily on the catalytic activity of ACSS2. Notably, 
acetate-induced genes were not regulated by treatment with ACSS2i 
in the absence of acetate (Extended Data Fig. 6e). GO analysis of the 
upregulated genes that were sensitive to ACSS2i showed enrichment for 
nervous system processes, behaviour, learning and memory (Extended 
Data Fig. 6f) and specific genes showed sensitivity to ACSS2i (Extended 
Data Fig. 7a—d). For example, Slc17a7 was upregulated after treatment 
of wild-type hippocampus cells with acetate, but induction was dimin- 
ished when ACSS2 was inhibited (Extended Data Fig. 7a). Slcl7a7 encodes 
vesicular glutamate receptor 1, whichis implicated in hippocampal syn- 
aptic plasticity, addiction and alcohol use”. In addition, impaired DNA 
methylation of Ccnjl (cyclinJ-like) has been linked to prenatal alcohol 
exposure and fetal alcohol spectrum disorder (FASD)"*° (Extended Data 
Fig. 7b). Further analysis revealed that the ACSS2i-sensitive and acetate- 
upregulated genes were also bound by hippocampal ACSS2 (which has 
been investigated by ChIP-seq previously’), and binding was proximal 
to the promoter at baseline without any direct behavioural stimulation 
in vivo? (Extended Data Fig. 8a). GO analysis linked these ACSS2 target 
genes to intricate plasticity-related mechanisms that involve axono- 
genesis and the activity of voltage-gated ion channels (Fig. 3b). Cor- 
respondingly, motif analysis of ACSS2-targeted, acetate-induced and 
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Fig. 2|Mass spectrometry analysis of ethanol-d6 in mice with knockdown of 
ACSS2 expression in the dHPC. a, Knockdown of ACSS2 expression in the dHPC 
(dKD) prevents the incorporation of the heavy label into histone acetylation. 
WT, wildtype. b, In the same mouse, incorporation of the heavy label into 
histone acetylation in the vVHPC (where levels of ACSS2 are normal) is unchanged 
compared to control mice. c, ChIP-seq for H3K9ac and H3K27ac in untreated 
and ethanol-treated wild-type and ACSS2-knockdown (KD) mice (n=3 
independent replicates). The genome-browser track view shows the Fstl1 locus 
(chr16: 37,776,000-37,793,000). d, e, ChIP-seq for H3K9ac (d) and H3K27ac (e) 
in vivo shows increased acetylation genome-wide after injection of ethanol (339 
out of 458 H3K9ac peaks and 490 out of 816 H3K27ac peaks called with MACS2; 
10% FDR usedas significance threshold for DiffBind). P<2.2 x10 (d); 
P=8.42x10™ (e) (two-sided Mann-Whitney rank-sum test). f, g, Induction of 
H3K9ac (f) and H3K27ac (g) is diminished in ACSS2-knockdown mice (458 
H3K9ac peaks, 816 H3K27ac peaks). P< 2.2 x10 (f); P=2.22 x 10° (g) (two-sided 
Mann-Whitney rank-sum test). Box plots show the values for the first and third 
quartiles (box limits) and the median (centre), with whiskers extending to 

1.5x the interquartile range. 


ACSS2i-sensitive genes implicated the involvement of neuronal transcrip- 
tion factors—including E2F3 and NRSA2 (Fig. 3c), which have been linked 
to neurodifferentiation and drug-related regulation of behaviour’. 
There was a substantial overlap between genes that were upregulated 
by alcohol in vivo in the dHPC and genes that were induced by acetate 
ex vivo—RNA-seq identified 830 alcohol-responsive hippocampal genes 
that overlapped with the ex vivo differentially expressed genes (Fig. 3d)— 
which suggests that translating our ex vivo model to the in vivo situation 
is valid. GO analysis for these overlapping genes highlighted the enrich- 
ment of genes that are related to neuronal plasticity, including those with 
roles insynapses, neuron projection and axons; however, genes related 
to ribosomal and mitochondrial functions were also enriched (Extended 
Data Fig. 8b). Notably, a previously published microarray dataset of 
hippocampal genes that are regulated by alcohol in vivo also showed 
substantial overlap with our list of genes that are induced by acetate ex 
vivo (81 of 214 (38%) alcohol-responsive hippocampal genes in the micro- 
array”). Next, we showed (starting froma complementary analysis of our 
in vivo data) that target genes of ACSS2 in the hippocampus that show 
alcohol-induced H3K9ac in vivo were also upregulated by treatment of 
hippocampal neurons with acetate ex vivo, and that ACSS2i blocks this 
gene induction (Fig. 3e). The equivalent relationship existed for hip- 
pocampal genes that show alcohol-induced H3K27ac in vivo; these genes 
were not induced by acetate ex vivo in the presence of ACSS2i (Fig. 3f). 
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Fig. 3 | Acetate-induced transcription mediated by ACSS2 in primary 
hippocampal neurons. a, RNA-seq in primary hippocampal neurons that were 
isolated from C57/Bl6 mouse embryos and treated with acetate (5 mM) inthe 
presence of ACSS2i or vehicle (DMSO). The first two columns of the heat map 
show 7,600 genes that were differentially expressed after treatment with 
acetate, and the third column shows the behaviour of those genes inthe 
presence of ACSS2i. A total of 2,107 of the 3,613 genes that were induced by 
acetate were not upregulated in the presence of ACSS2i (n=4 per group). b, GO 
analysis of genes that are both sensitive to acetate and directly bound by ACSS2. 
Data are from ACSS2 ChIP-seq (n= 429 genes). Genes were assessed using a 
modified Fisher’s exact test (EASE) with the FDR corrected by the Yekutieli 
procedure; -log,, transformations of nominal P values are shown. c, HOMER 
(hypergeometric optimization of motif enrichment) unsupervised analysis of 
hippocampal ACSS2-binding sites in the hippocampus, targeting acetate- 
sensitive genes. De novo motif analysis of 751 ACSS2 peaks; a hypergeometric 
test was used for each motif to compare to the background set of ACSS2 peaks 
that do not target acetate-sensitive genes. d, Overlap of genes that are 
upregulated by treatment with ethanol in vivo (AHPC) and acetate in vitro 
(n=830).P=3.48 x10?” (hypergeometric test of gene-set overlap). e, f, ACSS2 
target genes with alcohol-induced H3K9ac (e) and H3K27ac (f) in vivo are 
upregulated by acetate in hippocampal neurons in vitro. ACSS2i blocks this gene 
induction. n= 285 genes (e) andn=362 genes (f), tested against an equal number 
of control genes using a two-sided Mann-Whitney rank-sum test; P=0.0077 (e), 
P=0.0013 (f). Gene expression is given in reads per kilobase of transcript per 
million mapped reads (RPKM). Box plots as in Fig. 2. 


Together, these findings suggest that ACSS2 may have arole inalcohol- 
related learning by coordinating alcohol-induced histone acetylation 
and gene expression. To examine potential behavioural effects in wild- 
type and ACSS2-knockdown mice, we performed ethanol-mediated 
conditioned place preference (CPP), which has previously been used to 
assess ethanol-associated learning”. In this paradigm, mice are exposed 
to neutral and rewarding stimuli in distinct spatial compartments, dis- 
tinguished by environmental cues. After conditioning, CPP is measured 
by allowing the mice free access to either compartment and measuring 
the time spent in the chamber that is associated with rewarding stimuli 
(Fig. 4a). To assess place-preference learning, we calculated the mean 
time spent in the conditioned (paired with alcohol) and unconditioned 
(paired with saline) chambers (Extended Data Fig. 9c), as well as a CPP 
score, which we defined as the difference between the time spent inthe 
conditioned and the unconditioned chamber (Fig. 4b). We found that 
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Fig. 4 | ACSS2 is required for alcohol-induced associative learning. 

a, Schematic of ethanol-induced CPP. b, Preference scores for the ethanol-paired 
chamber in wild-type mice (n= 8; P= 0.0391) and mice with knockdown of ACSS2 
inthe dHPC (n=10; P=0.4316). Data are mean +s.e.m.; Pvalues by Wilcoxon 
matched-pairs signed-rank test. c, Model. Acetate from the breakdown of 
alcohol in the liver is activated by neuronal ACSS2 in the brain and induces gene- 
regulatory acetylation of histones. HAT, histone acetyltransferase. 


wild-type mice spent longer in the compartment to which ethanol was 
delivered during training (P= 0.0391; Wilcoxon test) (Fig. 4b). Notably, 
acquisition of CPP depends on spatial memory formation in the dHPC 
and, accordingly, dHPC lesions disrupt place conditioning”. To test the 
importance of ACSS2 in the dHPC, we injected mice with AAV9 virus 
expressing shRNA against Acss2 to reduce the protein level of ACSS2 
(n=10), or control virus expressing GFP only (n= 8) (Extended Data 
Fig. 9a, b). We detected a significant main effect of the conditioning 
subgroup (P= 0.0227, F, 3.=5.731; main effect of training from a two-way 
analysis of variance (ANOVA) across the four groups), showing that the 
ethanol-induced CPP procedure was successful. In addition, we observed 
asignificant interaction between the ACSS2-knockdown and condition- 
ing subgroups (P= 0.0462, F, ,.=4.303; interaction from two-way ANOVA 
across the four groups), indicating that the knockdown of ACSS2 in 
the dHPC significantly reduced the expression of CPP. Strikingly, we 
found that ethanol-associated CPP was suppressed in mice in which 
ACSS2 expression was reduced in the dHPC (P= 0.4316; Wilcoxon test) 
(Fig. 4b), indicating that the formation of ethanol-related associative 
memories requires ACSS2. 

Overall, our ex vivo and in vivo molecular data, together with our 
behavioural findings, show that ACSS2 is required for the incorporation 
of heavy-labelled acetate into acetylated histones in the dHPC, and that 
this facilitates memory-related gene expression and alcohol-related 
associative learning (Fig. 4c). These results establish ACSS2 as a prom- 
ising candidate for therapeutic intervention in alcohol-use disorders, 
in which the memory of alcohol-associated environmental cues is a 
primary driver of craving and relapse. 

Alcohol exposure not only disrupts epigenetic and transcriptional 
processes in the adult brain, but is also linked to epigenetic dysregu- 
lation in the gestating fetus~. In utero, alcohol is an environmental 
teratogen that affects the expression of neurodevelopmental genes 
and can elicit numerous alcohol-associated postnatal disease pheno- 
types, which together are categorized as FASD”. Previous investiga- 
tions of alcohol-mediated epigenetic changes in utero have implicated 
altered histone acetylation in FASD™, but the underlying mechanisms 
are unknown. 
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d, Metabolized ethanol-d6 is incorporated into histone acetylation inthe 
maternal brain. e, Incorporation of ethanol-d6 into histone acetylation inthe 
fetal brain. Data represent the first of two pools of embryos (n=4 per pool) from 
maternal ethanol-d6 injection. The axes of the Arachne plots represent the 
percentage of the third isotope for the acetylated peptide, corresponding to the 
d,-labelled form. 


We investigated whether alcohol affects dynamic histone acetyla- 
tion in utero in the developing fetal midbrain and forebrain of mice at 
embryonic day (E)18.5. Mass spectrometry of fetal brain showed that 
‘binge-drinking-like’ alcohol exposure, in parallel with maternal labelling 
of neuronal histone acetylation, resulted in the deposition of alcohol- 
derived acetyl groups onto histones in the fetal forebrain and midbrain 
in early neural development (Fig. 4e, Extended Data Fig. 9d), indicating 
an unanticipated potential mechanism for the aetiology of FASD. 

In the adult brain, epigenetic mechanisms that control gene expres- 
sion havea key role in processing neural activity to continuously adapt 
circuit connectivity and behaviour®”®. Here, we show that exposure to 
alcohol gives rise to the acetylation of histones in the brain both directly 
(through the direct incorporation of alcohol-derived acetate) and indi- 
rectly (through other metabolic pathways). Incorporation of alcohol- 
derived acetate into histone acetylation was recently observed in the 
liver’”. However, to our knowledge, our data provide the first empirical 
evidence indicating that a portion of acetate that is derived from the 
metabolism of alcohol directly influences epigenetic regulation in the 
brain. We show that this direct pathway has important functional and 
behavioural consequences, shedding light on a neurobiological aspect 
of alcohol use. Given that effects of ethanol on the brain and behaviour 
are complex, further studies will be required to determine the relative 
contributions of ethanol-derived histone acetylation, ethanol-induced 
intracellular signalling pathways and ethanol-related redox stress. We 
also show that histone acetylation in the brain occurs through the genera- 
tion of acetyl-CoA by ACSS2. Inthe hippocampus, the incorporation of 
acetyl groups that are derived from alcohol may be critical for alcohol- 
related associative learning, which encodes environmental cues associ- 
ated with alcohol that drive craving, seeking and consumption even after 
protracted periods of abstinence. The direct pathway that we identify 
here substantially furthers our understanding of alcohol-induced epi- 
genetic regulation in the brain, which has previously been limited to the 
indirect effects of alcohol-induced intracellular signalling and changes 
in the expression or activity of histone-modifying enzymes. The direct 
pathway contributes to a large proportion of the histone acetylation that 
occurs after ethanol exposure, and suggests that the incorporation of 


alcohol-derived acetyl groups is physiologically relevant and associated 
with the transcriptional and behavioural adaptations that are induced 
by ethanol. Notably, our findings suggest that other peripheral sources 
of physiological acetate—primarily the gut microbiome—may affect 
central histone acetylation and brain function ina similar manner, which 
may either control or foster other metabolic syndromes. Translational 
treatment strategies that target this nexus between peripheral metabolic 
activity and neuroepigenetic regulation may pave the way for therapeutic 
interventions for alcohol use and other neuropsychiatric disorders. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. All ani- 
mals were randomly allocated into experimental and control groups. 
Mass spectrometry analysis of heavy labelled samples and analysis of 
behavioural experiments was performed by blinded investigators. 


Stable isotope labelling of brain histone acetylation 

We injected mice intraperitoneally with 2 g kg‘ ethanol-d6 or control 
saline, and assessed deuterium in corporation into acetylated his- 
tones at baseline, as well as at land 4 h after intraperitoneal injections 
(Fig. 1a). We confirmed that the injected ethanol-d6 is readily metabo- 
lized to acetate that becomes systemically accessible (Extended Data 
Fig. 1a), resulting in rapid labelling of blood acetate (Extended Data 
Fig. lb). Using quantitative liquid chromatography-mass spectrometry 
(LC-MS), we quantified the relative abundance of isotopically labelled 
histone acetylation in the brain and in peripheral tissues (Fig. 1a). 
Analysis of the patterns of heavy labelling showed that injection of 
heavy-labelled ethanol leads to marked increases of the M+3 species 
(Extended Data Fig. le, f, Supplementary Table 1), which suggests that 
ethanol-derived acetate contributes to histone acetylation through 
direct deposition of triply deuterated acetyl-CoA. This increase in the 
M+3 species was also evident when accounting for the natural levels 
of heavy isotopes of carbon and hydrogen (Extended Data Fig. 1g, h; 
see also Methods for detailed descriptions). The increases of M+1 and 
M+2 species (Extended Data Fig. 1h) could be driven by deuteron back 
exchange” or by singly and doubly deuterated metabolites, indicat- 
ing that alternative metabolic pathways also contribute to histone 
acetylation labelling; however, the major increase is to the M+3 spe- 
cies through triply deuterated acetate. We also performed tracing 
of C-labelled alcohol and found rapid incorporation into histone 
acetylation in the hippocampus, equivalent to ethanol-d6 (Extended 
Data Fig. 2c). We examined whether alcohol exposure during gesta- 
tion influences histone acetylation in the developing fetal brain by 
measuring the direct deposition of alcohol-derived acetyl groups onto 
histones. Using the protocol described above (intraperitoneal injec- 
tion of heavy-labelled alcohol (2 g kg“) followed by mass spectrometry 
on isolated histone proteins) we confirmed incorporation of alcohol 
metabolites (4 h after injection) into the acetylation of histones in 
neurons of gestating female mice (Fig. 4d), consistent with the previ- 
ous results in males (Fig. 1b). 


Histone extraction 

Histones were extracted as previously described’. The cells were incu- 
bated in nuclear isolation buffer (15 mM Tris-HCI, 15 mM NaCl, 60 mM 
KCI, 5mM MgCl, 1 mM CaCl, and 250 mM sucrose at pH 7.5; 0.5 mM 
AEBSF, 10 mM sodium butyrate, 5nM microcystein and 1mM DTT added 
fresh) with 0.2% NP-40 on ice for 5 min. The nuclei were collected by 
centrifuging at 700g at 4 °C for 5 min. The resulting nuclear pellet 
was washed twice with the same volume of nuclear isolation buffer 
without NP-40. Histones were then acid-extracted with 0.2 MH,SO, 
for 3 hat 4 °C with rotation. The insoluble nuclear debris was pelleted 
at 3,400g at 4 °C for 5 min, and the supernatant was retained. Next, 
histone proteins were precipitated by adding 100% trichloroacetic acid 
ina1:3 ratio (v/v) for Lhat 4 °C. The pellet was washed with acetone to 
remove residual acid. Histones were resuspended in 30 pl of S50 mM 
NH,HCO; (pH 8.0). 


Histone propionylation and digestion 

Histones were derivatized and digested as previously described®. The 
sample was mixed with 15 pl derivatization mix, consisting of propionic 
anhydride and acetonitrile in a1:3 ratio (v/v), and this was immediately 
followed by the addition of 7.5 pl ammonium hydroxide to maintain 
pH 8.0. The sample was incubated for 15 min at 37 °C, dried and the 


derivatization procedure was repeated one more time. Samples were 
then resuspended in 50 mM NH,HCO, and incubated with trypsin 
(enzyme:sample ratio of 1:20) overnight at room temperature. After 
digestion, the derivatization reaction was performed again twice to 
derivatize the N termini of the peptides. Samples were desalted using 
C18 stage tips before LC-MS analysis. 


Nanoscale liquid chromatography-tandem mass spectrometry 
Samples were analysed by using a nanoscale liquid chromatography- 
tandem mass spectrometry (LC-MS/MS) setup. Nanoscale liquid 
chromatography was configured with a 75 um ID x 25 cm Reprosil-Pur 
C18-AQ nano-column (3 pm; Dr. Maisch HPLC GmbH) using an EASY- 
nLC nano-flow high-performance liquid chromatography (HPLC) 
system (Thermo Fisher Scientific), packed in-house. The HPLC gradi- 
ent was as follows: 5% to 32% solvent B in solvent A (A = 0.1% formic 
acid; B = 80% acetonitrile, 0.1% formic acid) over 45 min, from 32% 
to 90% solvent B in 5 min and 90% B for 10 min at a flow rate of 300 
nl min“. Nanoscale liquid chromatography was coupled to an Orbitrap 
Elite mass spectrometer (Thermo Fisher Scientific). The acquisition 
method was data-independent acquisition (DIA) as described”. In 
brief, two full-scan mass spectra (m/z 300-1,100) were acquired in 
the ion trap within a DIA duty cycle, and 16 MS/MS were performed 
with an isolation window of 50 Da. Normalized collision energy was 
set to 35%. 


Data analysis 

Raw mass spectrometry data were analysed manually. We selected the 
seven most-intense peptides of histones H3 and H4 that contained acety- 
lations, and extracted the relative abundance of the M+1, M+2 and M+3 
signals compared to the monoisotopic signal. The other peptides were 
not considered as, owing to their low abundance, we could not reliably 
quantify the relative abundance of all the isotopes. The percentage 
represented in the radar plots indicates the relative intensity of the M+3 
signal (the fourth isotope) as compared to the monoisotopic signal. Data 
were not normalized to the non-labelled sample, so that the relative 
abundance of the natural isotopic distribution could also be observed 
inthe untreated mice. 

For statistical analysis, we performed two-tailed heteroscedastic 
t-tests (significant at P < 0.05) when comparing the same isotope in 
treated versus untreated samples (Supplementary Table 1). In this 
analysis, we considered differences in the relative abundance of all 
species (M+1, M+2, M+3), and found that major changes occur in M+3 
(Supplementary Tables 1, 2). We used the R package enviPat” to esti- 
mate the theoretical relative abundance of the first four isotopes of 
the seven peptides considered in this study, which showed no signifi- 
cant difference to the observed isotopic distribution of the untreated 
samples (Extended Data Fig. 1g). Natural abundance corrections were 
performed using FluxFix”, to calculate the exact relative abundance 
of the M+1, M+2 and M+3 species in the samples that were treated 
with labelled ethanol or acetate (Extended Data Fig. 1h). We used a 
matrix-based approach to correct natural abundance, as proposed 
previously. This calculation corrects for the contribution of each 
isotopologue, removing, for example, from the M+3 signal, the por- 
tion of the isotopic pattern that is contributed by the increase of the 
M+1 and M+2 species. 


ChIP-seq 

ChIP-seq was performed as previously described with modifications 
for the preparation of mouse brains™. In brief, approximately 20 mg 
of dHPC from each mouse was minced onice, cross-linked with 1% for- 
maldehyde for 10 min and quenched with 125 mM glycine for 5 min. 
Nuclei were prepared by dounce homogenization of cross-linked 
tissue in nuclei isolation buffer (SO mM Tris-HCl at pH 7.5, 25 mM KCI, 
5mM MgCl2, 0.25 M sucrose) with freshly added protease inhibitors 
and sodium butyrate. Nuclei were lysed in nuclei lysis buffer (10 mM 


Tris-HCl at pH 8.0, 100 mM NaCl, 1 MM EDTA, 0.5 mM EGTA, 0.1% sodium 
deoxycholate, 0.5% N-lauroylsarcosine) with freshly added protease 
inhibitors and sodium butyrate, and chromatin was sheared using 
a Covaris $220 sonicator to approximately 250 bp in size. Equal ali- 
quots of sonicated chromatin were used per immunoprecipitation 
reaction with 5 pl H3K9ac antibody (Active Motif; 39137; 09811002) or 
4 pl H3K27ac antibody (Abcam; 4729, GR323132-1) preconjugated to 
Protein G Dynabeads (Life Technologies). Ten percent of the chromatin 
was saved as input DNA. ChIP reactions were incubated overnight at 
4 °C with rotation and washed three times in wash buffer. Immuno- 
precipitated DNA was eluted from the beads, reversed cross-linked 
and purified together with the input DNA. Exactly 10 ng DNA (either 
ChIP or input) was used to construct sequencing libraries using the 
NEBNext Ultra II DNA library preparation kit for IIlumina (New England 
Biolabs; NEB). Libraries were multiplexed using NEBNext Multiplex 
Oligos for Illumina (dual index primers) and single-end sequenced 
(75 bp) onthe NextSeq 500 platform (Illumina) in accordance with the 
manufacturer’s protocol. 


ChIP-seq analysis 

ChIP-seq tags generated with the NextSeq 500 platform were demul- 
tiplexed with the bcl2fastq utility and aligned to the mouse reference 
genome (assembly GRCm38 (mm10)) using Bowtie v.1.1.1, allowing 
up to two mismatches per sequencing tag** (parameters -m 1-best). 
Peaks were detected using MACS2 (tag size = 75 bp; FDR <1 x 10°) 
from pooled H3K9ac or H3K27ac tags of mice from the same condi- 
tion along with treatment-matched input tags as control. The multiple 
transcription-factor-binding loci (MTL) method® was used to com- 
pare H3K9ac or H3K27ac enrichment in the four study conditions. 
The statistical significance of differential H3K9ac or H3K27ac enrich- 
ments was assessed by using DiffBind (Bioconductor v.3.7) in a two- 
way comparison (wild-type mice treated with saline versus wild-type 
mice treated with ethanol, or ACSS2-knockdown mice treated with 
saline versus ACSS2-knockdown mice treated with ethanol) across 
the individual replicate samples (FDR < 10%). UCSC genome-browser 
track views were created for ChIP-seq data by first pooling replicates 
and generating coverage maps using BEDtools genomeCoverageBed 
-bg, then adjusting for library size using the RPM (reads per million) 
coefficient. The input signal was then subtracted from the ChIP signal. 
The resulting tracks were converted from bedGraph to bigWig files 
using the bedGraphToBigWig function in the UCSC genome browser. 
RNA-seq tracks were created similarly, first splitting by tag orientation 
to the genomic reference strand and then creating coverage maps. 
Because an RPM adjustment might disguise a large deformation inthe 
transcriptome distribution, maps were adjusted for library size using 
the average scalar coefficient size factor determined by DESeq2. The 
resulting tracks were converted to bigWig files, as for ChIP-seq tracks, 
and the + and — tags from a given sample were plotted as overlays in 
a track hub. 


RNA-seq 

All RNA-seq data were prepared for analysis as follows. NextSeq sequenc- 
ing data were demultiplexed using native applications on BaseSpace. 
Demultiplexed FASTQs were aligned by RNA-STAR v.2.5.2 to the assembly 
mm10 (GRCm38). Aligned reads were mapped to genomic features using 
HTSeq v.0.6.1. Quantification, library size adjustment and analysis of 
differential gene expression was done using DESeq2 and Wald’s test 
(pairwise comparisons between treatment with acetate and DMSO in 
the inhibitor-treated or untreated cells, followed by a set overlap of 
differentially expressed genes). Overlaps between lists of genes were 
tested for significance using a hypergeometric test. 


Functional analysis 
GOanalysis was performedusing DAVID withanFDRcut-offof10%, filtering 
categories with fewer than 10 genes or less than 2.5 fold enrichment 


over background. Motif analysis was performed using HOMER v.4.6 on 
all ACSS2 peaks from published in vivo data’, targeting (by the nearest 
transcription start site) a gene sensitive to acetate with H3K9ac at the 
promoter using a fixed window of 300 bp”. 


Primary hippocampal neurons 

Plated primary hippocampal neurons were obtained from the NeuronsR 
Us neuronal core at the University of Pennsylvania. Cells were maintained 
in neurobasal medium (NBM; Gibco) supplemented with GlutaMAX 
(Gibco) and B27 (Gibco). After 7 days of differentiation, cells were treated 
for 24 h with 5 mM acetate or vehicle (NBM) in the presence or absence 
of 20 pM ACSS2i or vehicle (DMSO diluted into NBM). Cells were lysed 
using QIAzol (Qiagen) and RNA was extracted using the RNeasy Mini 
Kit (Qiagen). RNA-seq libraries were prepared using the NEBNext Ultra 
II Directional RNA library preparation kit (NEB). 


Mouse experiments 

Animal use, surgical procedures and all experiments performed were 
approved by the Institutional Animal Care and Use Committee (protocol 
804849). All personnel involved have been adequately trained and are 
qualified according to the Animal Welfare Act and the Public Health 
Service policy. Adult male mice (8 weeks old) or E18.5 pregnant females 
were used. Ethanol, ethanol-d6 (Sigma-Aldrich; 186414), ethanol-1-3C 
(Sigma-Aldrich; 324523) and sodium acetate-d3 (Sigma-Aldrich) were 
administered by intraperitoneal injection and dosed at 2g kg‘ (20% solu- 
tionin saline, filtered through a Stericup GV filter). CPP was performed 
as described previously®. In brief, mouse CPP boxes (Ugo Basile; 42553) 
with external dimensions 35 x 18 x 29 cm were used. The apparatus was 
divided into two chambers (16 x 15 x 25 cm) that differed in wall and 
floor pattern. Striped walls were paired with circle cutouts (1cm) and 
solid grey walls were paired with square cutouts (0.5 cm). Sessions were 
runina dark room at ambient temperature. Boxes were cleaned with 
70% ethanol between mice and allowed to dry between rounds. The 
paradigm consisted of 1 habituation day (5 min exploration in neutral 
environment), 1 pre-training session (20 min with access to both cham- 
bers), 8 training days (biased subject assignment, alternating sessions 
of intraperitoneal injection of saline or ethanol immediately before 
the 5-min session) and 1 post-training test session (20 min with access 
to both chambers). The percentage of time spent in the conditioned 
chamber was measured by blinded investigators. Preference scores 
were calculated as the difference between the time spent in the con- 
ditioned chamber and the unconditioned chamber. A Shapiro-Wilk 
test was used to assess normal distribution and a Mann-Whitney test 
to determine learning. 


Intracranial injection of viral vector 

Adult male C57BL/6J mice of 10 weeks of age were anaesthetized 
with isoflurane gas (1-5% to maintain surgical plane) and placed ina 
sterile field within a stereotaxic device. Artificial tears were applied 
to eyes to ensure sufficient lubrication. Mice received an injection 
of bupivacaine (2.5 mg kg”) for local anaesthesia before the skin 
was disinfected with betadine solution and the skull exposed witha 
short incision using sterile surgical equipment. A small hole (about 
0.5 mm) was drilled in the skull over the target area using a stere- 
otax and a stereotactic drill. A micro-syringe filled with viral vector 
was inserted into the dHPC and slowly removed following injection 
(AP, -2.0 mm; DV, -1.4 mm; ML, + 1.5 mm from bregma). The vector 
for ACSS2 knockdown was AAV2/9.U6.shACSS2.CMV.EGFP. All mice 
received a single dose of subcutaneous meloxicam (5 mg kg”) as 
analgesia at induction and one dose per day for two days postop- 
eratively as needed. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Data availability 


All RNA-seq and ChIP-seq data are available at the Gene Expression 
Omnibus (GEO) with accession number GSE122188. Raw mass spec- 
trometry data are provided as Supplementary Table 2. 
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Extended Data Fig. 1|See next page for caption. 
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Extended Data Fig. 1| Ethanol-derived acetyl groups are rapidly 
incorporated into histone acetylation in the brain. a, Mass spectrometry 
analysis of serum acetate shows the rapid increase in levels of acetate in mice 
that were injected with alcohol, at 30 min after injection (n =3 for saline, n=4 for 
acetate group). Data are mean +s.e.m.; P= 0.0258 (two-tailed unpaired ¢-test). 
b, Ethanol-d6 is readily metabolized and thus labels blood acetate pools. 
Acetate-d3 was detected by mass spectrometry (n=4 per group). Dataare 

mean +s.e.m.; P= 0.0016 (two-tailed unpaired t-test). c, Incorporation of the 
ethanol-d6 label into histone acetylation in the cortex shows a similar pattern to 
the hippocampus. The axis of the Arachne plot represents the percentage of the 
third isotope for the acetylated peptide, corresponding to the d,-labelled form; 
the natural relative abundance of that isotope is apparent in the ‘none’ and 
‘saline 1h’ treatment groups d, Histone acetylation is relatively independent of 
alcohol metabolism in skeletal muscle, a tissue in which the expression of ACSS2 
is low. e, f, Mass spectra (representative examples from three biological 
replicates) showing the relative abundance of deuterated histone H4-triacetyl 


peptide (amino acids 4-17) in the hippocampus of wild-type mice at baseline and 
4h after injection of ethanol-d6. Increases of the M+1 (blue lines), M+2 (green 
lines) and M+3 (red lines) species are shown in e and indicate a major increase of 
M+3. The contribution of singly (orange), doubly (grey) and triply (yellow) 
deuterated peptides to the isotopic distribution is shown inf. The relative 
abundance of the M+3 species is increased by about sixfold at 4 h after injection 
of ethanol-d6, and is overwhelmingly due to the triply deuterated peptides; by 
contrast, the contribution of singly and doubly deuterated peptides to the M+3 
species is minimal. The experiment was performed with three biological 
replicates per group. g, The relative abundance of the first four isotopes of each 
of the seven peptides in the untreated samples corresponds to the theoretical 
isotopic distribution of the peptides (calculated using enviPat*°; samples not 
treated with ethanol-d6;n=20). Dataare mean +s.d.h, Natural abundance- 
corrected contribution of M+1, M+2 and M+3 species to the labelling of histone 
acetylation in the liver and hippocampus after intraperitoneal injection of 
ethanol-d6 (calculated using FluxFix”; n=3 per group). Dataare mean+s.d. 
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Extended Data Fig. 2| Dynamics of ethanol- and acetate-induced heavy- 
label incorporation. a, b, Relative abundance of deuterated histone acetylation 
in dHPC, vHPC, cortex, liver and muscle at 8 h (a) and 24 h (b) after 
intraperitoneal injection of ethanol-d6.c, °C-labelled ethanol, introduced by 
intraperitoneal injection, readily labels histone acetylation in the hippocampus 
(percentage increase over natural abundance of “C acetyl groups in saline- 
injected mice; n=1).d,Incontrast to heavy ethanol-d6, anon-labelled ethanol 
control does not increase the natural abundance of heavy histone acetylation in 
the hippocampus. e, Histone acetylation is relatively independent of alcohol 


Cortex 
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metabolism in skeletal muscle. Relative abundance of deuterated histone 
acetylation in skeletal muscle tissue at 30 min and 4 hin wild-type mice, and 
30 min in dHPC ACSS2-knockdown mice. f, g, Heavy acetate, introduced by 
intraperitoneal injection, readily labels histone acetylation in the dHPC (f) and 
the cortex (g) (n=2 at 30 min; n=3 per group at other time points). Data are 
mean+s.e.m.h, Levels of acetate measured by mass spectrometry in 
hippocampal tissue after injections of acetate and ethanol (n=3 per group). 
Data are mean +s.e.m.; P=0.0335 for acetate versus saline; P= 0.0285 for 
ethanol versus saline (two-tailed unpaired ¢-test). 
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Extended Data Fig. 3 | Metabolite labelling in hippocampal tissue 30 min 
after intraperitoneal injection of ethanol-d6. a-f, Mass spectrometry 
quantification of metabolite labelling in hippocampal tissue at 30 min after 
intraperitoneal injection of ethanol-d6. The ethanol-d6 label was incorporated 
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into the acetate pool in the hippocampus (a). By contrast, ethanol-d6 did not 
contribute to the glucose (b) or 3-hydroxybutyrate (d) pools inthe 
hippocampus, and only minimally to the lactate pool (c). Labelling was observed 
in the hippocampal glutamine (e) and citrate or isocitrate (f) pools. 


H3K27ac 


H3K9ac 


H3K27ac 


H3K9ac 


H3K27ac 


H3K9ac 


WT untreated 
WT EtOH 
KD untreated 
KD EtOH 
WT untreated 
WT EtOH 
KD untreated 


KD EtOH 


WT untreated 
WT EtOH 
KD untreated 
KD EtOH 
WT untreated 
WT EtOH 
KD untreated 


KD EtOH 


WT untreated 
WT EtOH 
KD untreated 
KD EtOH 
WT untreated 
WT EtOH 
KD untreated 


KD EtOH 


Extended Data Fig. 4 | Representative H3K9ac and H3K27ac dHPC ChIP-seq 
tracksin control and ethanol-treated wild-type and ACSS2-knockdown 
mice. a-c, ChIP-seq for H3K9ac and H3K27ac in untreated and ethanol-treated 
wild-type and ACSS2-knockdown mice. Genome-browser track views show the biological replicates per group. 
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Extended Data Fig. 5 | Epigenetic and transcriptional changes inthe dHPC of 
control and ethanol-treated wild-type and ACSS2-knockdown mice. a-d, 
Decile plots of genes that are enriched in H3K9ac (a) and H3K27ac (b) show 
correlation with mRNA expression levels in hippocampus, in wild-type mice1h 
after injection with ethanol. By contrast, in ACSS2-knockdown mice, the 
correlation between histone H3K9 acetylation (c) and H3K27 acetylation (d) and 
alcohol-related mRNA expression is largely lost (n=16,553 genes (population) 


arranged into ten equal-sized deciles by ChIP-seq enrichment of acetylation). 
Box plots as in Fig. 2. e, f, GO analysis on H3K9ac (e) and H3K27ac (f) peaks that 
are induced by ethanol in wild-type but not ACSS2-knockdown mice (n = 332 
H3K9ac peaks and n=480 H3K27ac peaks). GO enrichment analysis was 
performed using a modified Fisher’s exact test (EASE) with the FDR corrected by 
the Yekutieli procedure; —-log,, transformations of nominal Pvalues are shown. 
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Extended Data Fig. 6 | Transcriptional changes in primary hippocampal 
neurons that were treated with supraphysiological levels of acetate. 

a, Structure of ACSS2i (C,,H,s.N,0,S,). b, RNA-seq showing differentially 
regulated genes in primary hippocampal neurons that were treated with 5 mM 
acetate (n=4 replicates per group; volcano plot of likelihood ratio test (two- 
sided) in DESeq2; FDR controlled for multiple hypothesis testing). c,d,GO 
analysis of genes that are significantly upregulated (c; n=3,613 genes) and 
significantly downregulated (d;n =3,987 genes) after treatment with5 mM 
acetate. GO analysis was performed with GOrilla, using a minimal 
hypergeometric test. e, RNA-seq in primary hippocampal neurons that were 
isolated from C57/Bl6 mouse embryos and treated with acetate (5 mM) inthe 
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presence or absence of ACSS2i. Of the 3,613 acetate-induced genes, 2,107 are not 
upregulated in the presence of ACSS2i (n=3,613 induced genes (population) or 
3,613 randomly sampled genes (population); P< 2.2 x10" (two-sided Mann- 
Whitney rank-sum test)). Box plots as in Fig. 2. f, Acetate-induced genes in 
primary hippocampal neurons are shown in blue (n =3,613). Of these genes, 
2,107 (non-overlapping with orange) were sensitive to ACSS2i, and 1,506 were 
also induced in the presence of ACSS2i (overlapping with orange). GO 
enrichment analysis was performed using a modified Fisher’s exact test (EASE) 
with the FDR corrected by the Yekutieli procedure; —-log,, transformations of 
nominal Pvalues are shown. 
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Extended Data Fig. 7 | Representative RNA-seqtracksincontrolandacetate- diminished induction after treatment with ACSS2i (n= 4 per cohort). RNA-seq 
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Extended Data Fig. 8 | Transcriptional changes that are induced by acetate or proximal to the gene promoter. b, GO analysis for the 830 overlapping genes 
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changes in gene expression that are induced by ethanol. a, Cumulative genes (population)). GO enrichment analysis was performed using a modified 
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Extended Data Fig. 9 | Behavioural importance of ACSS2 expression inthe 
dHPCand heavy-label incorporation in the fetal brain. a, Representative image 
showing virus localization to the dHPC, and western blot (n=4 mice) showing ACSS2 
levels in the dHPC of wild-type and ACSS2-knockdown mice (for gel source data, see 
Supplementary Fig. 1). b, Quantification of the levels of ACSS2 protein in the dHPC 
and cortex of wild-type and dHPC ACSS2-knockdown mice (n=4 mice). Data are 
mean+s.e.m.;P=0.0001 and q=0.0001 for ACSS2-knockdown versus wild-type 
mice (dHPC); P=0.2666 and q=0.1347 for ACSS2-knockdown versus wild-type mice 
(cortex) (multiple t-test). c, ACSS2 is required for alcohol-induced associative 
learning. Mean time (seconds per minute) spent in unconditioned and ethanol- 
conditioned chambers following ethanol-induced CPP training in wild-type (n= 8) 
and dHPC ACSS2-knockdown mice (n=10). Bar graphs represent mean+s.e.m.and 
data points correspond to individual mice. d, Incorporation of ethanol-d6 into 
histone acetylation in the fetal brain. Data represent the second of two pools of 
embryos (n=4 per pool) from maternal ethanol-d6 injection. The axes of the 
Arachne plots represent the percentage of the third isotope for the acetylated 
peptide, corresponding to the d,-labelled form. 
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Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes were selected in accordance with the literature and based on previous experience in our group. The Berger lab has experience 
assessing animal behavior with pharmacological or genetic manipulations and in our experience, robust effects are achieved when group sizes 
are at least >7-9 animals. Regarding reproducibility, all of our RNA-seq and ChIP-seq datasets were replicated 3-4 times for each condition. 
Two replicates of RNA-seq are recommended by ENCODE: https://genome.ucsc.edu/ENCODE/protocols/dataStandards/ 
ENCODE_RNAseq_Standards_V1.0.pdf. 


Data exclusions — Pre-established exclusion criteria: mice with higher than 65% pre-training to either of the compartments in the CPP were excluded from 
training. All mice that were trained were included in the analysis (i.e. nothing was excluded after the experiment was run). Grubbs test was 
used to detect statistically significant outliers. One animal was excluded from the in vivo acetate deposition experiment due to failed i.p. 
injection (Extended Data Fig 2f/g, 30 min timepoint). No other data were excluded from the analyses. 

Replication RNAseq of primary hippocampal neurons was successfully performed on 4 biological replicates. In vivo alcohol and acetate injection in mice 
was successfully performed on 2-3 biological replicates per time point. Fetal heavy acetyl group incorporation in mice was successfully tested 
on 2 biological replicates (two independent pools of fetal brain tissue). 


Randomization All samples and animals were randomly allocated into experimental and control groups. 


Blinding Mass spectrometry analysis of heavy labeled samples was performed by blinded investigators. All animal testing was scored blindly. 


Reporting for specific materials, systems and methods 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
x Unique biological materials xX | ChIP-seq 
x | Antibodies x Flow cytometry 
x Eukaryotic cell lines x MRI-based neuroimaging 
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X | Animals and other organisms 


x Human research participants 


Antibodies 


Antibodies used H3K9ac antibody (Active Motif, cat # 39137; lot # 09811002), H3K27ac antibody (Abcam, cat #4729; lot # GR323132-1) 


Validation ChIP-validated antibodies. Relevant information and citations available on the manufacturers' websites; both antibodies were 
tested by ChIP analysis. Chromatin IP performed using the ChIP-IT® Express Kit (Catalog No. 53008) and HeLa Chromatin (1.5 x 
106 cell equivalents per ChIP) using 10 ul of Histone H3 acetyl! Lys9 antibody or the equivalent amount of rabbit IgG as a negative 
control. 
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Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Male and female C57BL/6J mice of 10 weeks of age were used. 
Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from the field. 


ChIP-seq 


Data deposition 


[Xx] Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


x | Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links GEO SuperSeries GSE122188 [NCBI tracking system #19551484] 
May remain private before publication. Reviewers can access all information using the following secure token: ynupekygbnqnbkl 
Files in database submission all raw sequencing data and processed data discussed in manuscript 
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bp) on the NextSeq 500 platform (Illumina) in accordance with the manufacturer's protocol. ChIP-seq tags generated with 
the NextSeq 500 platform were demultiplexed with the bcl2fastq utility and aligned to the mouse reference genome 
(assembly GRCm38/mm10) using Bowtie v1.1.1 [3] allowing up to two mismatches per sequencing tag (parameters -m 11 -- 


best). 
Antibodies H3K9ac antibody (Active Motif, cat # 39137; lot # 09811002); H3K27ac antibody (Abcam, cat #4729; lot # GR323132-1) 
Peak calling parameters ChIP-seq tags generated with the NextSeq 500 platform were demultiplexed with the bcl2fastq utility and aligned to the 


mouse reference genome (assembly GRCm38/mm10) using Bowtie v1.1.1 [3] allowing up to two mismatches per 
sequencing tag (parameters -m 1 --best). Peaks were detected using MACS2 (tag size = 75 bp; FDR < 1x10-3) from pooled 
H3K9ac or H3K27ac tags of mice from the same condition along with treatment-matched Input tags as control. 


Data quality The MTL method was used to compare H3kK9ac or H3K27ac enrichment in the four study conditions. Statistical significance 
of differential H3K9ac or H3K27ac enrichments was assessed by using DiffBind (Bioconductor v3.7) in a 2-way comparison 
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subtracted from ChIP signal. Resulting tracks were converted from bedGraph to bigWig using the Genome Browser's 
bedGraphToBigWig utility. RNA-seq tracks were created similarly, first splitting by tag orientation to the genomic reference 
strand and then creating coverage maps. Because an RPM adjustment might disguise a large deformation in the 
transcriptome distribution, maps were adjusted for library size using the average scalar coefficient size factor determined by 
DESeq2. Resulting tracks were converted to bigWigs as ChIP-seq tracks were, and the + and - tags from a given sample were 
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The enzyme protochlorophyllide oxidoreductase (POR) catalyses a light-dependent 
step in chlorophyll biosynthesis that is essential to photosynthesis and, ultimately, all 
life on Earth’ *. POR, which is one of three known light-dependent enzymes*”, catalyses 
reduction of the photosensitizer and substrate protochlorophyllide to form the 
pigment chlorophyllide. Despite its biological importance, the structural basis for POR 
photocatalysis has remained unknown. Here we report crystal structures of 
cyanobacterial PORs from Thermosynechococcus elongatus and Synechocystis sp. in 
their free forms, and in complex with the nicotinamide coenzyme. Our structural 
models and simulations of the ternary protochlorophyllide-NADPH-POR complex 
identify multiple interactions in the POR active site that are important for 
protochlorophyllide binding, photosensitization and photochemical conversion to 
chlorophyllide. We demonstrate the importance of active-site architecture and 
protochlorophyllide structure in driving POR photochemistry in experiments using 
POR variants and protochlorophyllide analogues. These studies reveal how the POR 
active site facilitates light-driven reduction of protochlorophyllide by localized 
hydride transfer from NADPH and long-range proton transfer along structurally 
defined proton-transfer pathways. 


As the light-driven step in the chlorophyll biosynthetic pathway (Fig. 1), 
the POR reaction acts as the trigger for the germination of seedlings in 
plants and provokes a marked change in the morphological development 
of the plant??. Given this crucial biological role, POR has been the focus of 
numerous mechanistic and biophysical investigations. A combination of 
time-resolved (at the femtosecond-to-second scale) and cryogenic spec- 
troscopy methods have provided some understanding of the mechanism 
of POR photocatalysis in a range of photosynthetic organisms, including 
cyanobacteria and plants. Picosecond excited-state dynamics in the 
protochlorophyllide (Pchlide) molecule are thought to result in excited- 
state interactions between the substrate and active-site residues that are 
necessary to trigger the subsequent reaction chemistry® ”. This involves 
the sequential transfer of ahydride equivalent from NADPH anda proton 
transfer from either an active-site residue or solvent. Proton transfer is 
reliant on solvent dynamics and animplied network of extended protein 
motions that occur on the microsecond timescale” ”. Hydride transfer 
from NADPH is not concerted but instead occurs ina stepwise manner 
that involves Pchlide excited-state electron transfer from NADPH fol- 
lowed by a proton-coupled electron transfer'®, which represents the 
first example of the stepwise transfer of a hydride equivalent that has 
been reported in biology. These time-resolved studies have provided 
insight into the chemistry of catalysis across a wide range of timescales 


(from the femtosecond to the second), but the required structural basis 
of POR photocatalysis has remained unknown. This structural context is 
required to understand how substrate binding, excited-state chemistry, 
bond making and/or breaking and the dynamics of photocatalysis are 
controlled by protein structure. 

Here we report the crystal structures of the apo-POR enzyme from 
T. elongatus (RCSB Protein Data Bank (PDB) code 6RNV) and NADPH- 
bound POR from both Synechocystis sp. and T. elongatus (PDB codes 
6G08 and 6RNW, respectively) (Extended Data Table 1a, Supplementary 
Methods), solved at 1.3 A, 1.9 Aand1.9 A resolution, respectively (Fig. 2, 
Extended Data Fig. 1). Asa member of the short-chain dehydrogenase 
and reductase (SDR) family of enzymes, the overall structure of POR 
is similar to that of other members of the SDR family” and has a typi- 
cal dinucleotide-binding Rossmann fold, which comprises a central 
B-sheet surrounded by six a-helices (Extended Data Figs. 1, 2). Three 
flexible regions of the structure (residues 146-160, 228-255 and 284— 
291; hereafter denoted R1, R2 and R3, respectively) are not observedin 
the T. elongatus electron density. This observation is consistent with 
molecular-dynamics simulations of both the apo- and coenzyme-bound 
structures (Extended Data Fig. 3), which indicates that these regions have 
a high degree of flexibility. Despite this, both the R1 region and the R2 
region are ordered inthe coenzyme-bound structure of the Synechocystis 
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Fig. 1| The light-driven reduction of the C-17-C-18 double bond of Pchlide to 
form chlorophyllide. The reaction is catalysed by POR andisa key regulatory 
step within the chlorophyll biosynthetic pathway. Catalysis involves excited- 


POR and therefore implicated in coenzyme binding to POR. A short loop 
(residues 223-229) within the R2 region extends from the central B-sheet, 
covers the nicotinamide moiety of the NADPH and becomes ordered 
upon binding of the coenzyme. A similar loop is observed over the coen- 
zyme-binding pocket of other SDR enzymes, and it has previously been 
shown that coenzyme binding causes the extended area around the loop 
to formtwo short a-helices that act as alid to cover the active site, which 
leads toa ‘closed’ conformation’. In this region of the Synechocystis 
POR, only one longer a-helix (residues 230-239; hereafter referred to 
as helix 1) (Fig. 2a) is observed—although an additional loop (residues 
148-159; hereafter referred to as loop 1) (Fig. 2a), which extends from 
the central B-sheet, is located near helix 1. It appears that loop 1, which 
is conserved in POR enzymes (Extended Data Fig. le, Supplementary 
Fig. 1), and helix 1 are important in controlling the subsequent binding 
of the Pchlide substrate. 

Beyond the loop regions, our POR crystal structures identify addi- 
tional residues that are important to coenzyme binding. The SDR family 
of proteins uses an Asn-Ser-Tyr-Lys catalytic tetrad for proton transfer 
and stabilization of reaction intermediates”; however, in POR a Thr 
residue (Thr145) often replaces the Ser residue (Supplementary Fig. 1). 
Crystal structures of the binary POR-NADPH complex show that three 
of these residues—Asn90, Tyr193 and Lys197 (numbering refers to the 
T. elongatus POR)—are directly hydrogen-bonded to the coenzyme 
(Fig. 2b, c). Also, Arg38, Lys42 and Asp63 interact with the coenzyme 
via a hydrogen-bonding network to the 2’ phosphate of the molecule 
(Fig. 2b, c, Extended Data Fig. 4). Arg38 is implicated in coenzyme bind- 
ing and release as it adopts alternative conformations in the different 
POR structures (Fig. 2d), a hypothesis that is further supported by the 
reduced NADPH binding affinity observed in an R38V variant”. A water 
channel surrounding the Tyr193 and Lys197 residues of the active site is 
also identified in our structures, which could act as a potential proton 
relay network during catalysis (Fig. 2e). 

Despite extensive efforts, crystals of the ternary POR complex (com- 
prising POR, Pchlide and NADP(H)) were not obtained, as a result of the 
enzyme adopting an array of oligomeric states on binding the substrate 
(Supplementary Fig. 2). We therefore used molecular docking to produce 
an initial model for the structure of the POR-Pchlide-NADPH ternary 
complex, followed by simulated annealing molecular-dynamics simu- 
lations and umbrella sampling to calculate the potential of mean force 


state interactions between the Pchlide and the protein, which lead to sequential 
hydride transfer from NADPH to the C-17 position and proton transfer to the C-18 
position. 


for Pchlide binding (Extended Data Figs. 3, 5). Although the potential of 
mean force changes over the time course of the simulation, the lowest 
energy conformation and the approximate dissociation energy remain 
stable and well-defined (Extended Data Fig. 5d, e). The resulting binding 
energy of about 35 kJ mol“ (which corresponds toa dissociation constant 
of 0.8 pM) agrees well with the experimental dissociation constant of 
about 2 uM (Extended Data Table 1b, Supplementary Fig. 3). The lowest 
energy conformation yields a ternary complex structure that is more 
rigid than the flexible, highly mobile structures that were observed for 
apo-POR and POR-NADPH (Extended Data Fig. 3a—c). In the ternary 
complex structure, helix 1 folds over the Pchlide and, together with 
loop 1, forms alid over the active site (Fig. 3a). The hydrophobic edge of 
Pchlide points towards hydrophobic regions of the enzyme (including 
the residues in helix 1), and the keto, methylester and carboxylic acid 
groups of the Pchlide molecule are positioned in a hydrophilic pocket 
(Fig. 3b). This provides numerous hydrogen-bonding opportunities and 
positions Pchlide in an orientation that is consistent with the required 
stereochemistry of hydride and proton transfer (Fig. 3c, Extended Data 
Fig. 5). Specific interactions include a salt bridge between Lys197 and 
the carboxylic acid side chain at the C-17 position, which also forms a 
hydrogen bond to Thr145. The central Mg”* ion of Pchlide is ligated to 
two water molecules, one of which appears to be part of a water network 
that also involves hydrogen bonds to Tyr223 and the methylester at the 
C-15 position. 

The photocatalytic implications of the determined structures and 
models of the active ternary complex were investigated using activ- 
ity, binding and inhibition studies of site-directed variants and ana- 
logues of Pchlide with modifications to substituent groups that are of 
potential importance to the enzyme mechanism (Extended Data Fig. 6, 
Extended Data Table 1b, Supplementary Figs. 3-8). Pchlide analogues 
were altered in the nature of the substituents on ring E, the central Mg”* 
ion and the carboxylic acid side chain at the C-17 position, all of which 
have previously been inferred as being important in POR-catalysed 
photoreduction”. An analogue in which the propionic acid side chain 
at the C-17 position is exchanged for a methylester group does not 
bind to POR. This analogue is also not a competitive inhibitor of the 
natural enzyme-catalysed reaction. This emphasizes the importance 
of this propionic acid side chain in Pchlide binding to the POR active 
site, consistent with our structural models. Mutagenesis of Lys197 or 
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Fig. 2| Crystal structures of light-dependent POR. a, Synechocystis POR 
crystal structure with NADPH bound in the active site (Supplementary 
Methods). The regions coloured orange are the missing loops in T. elongatus 
POR, whichare present in the crystal structure of Synechocystis POR. 

b, Structure of the coenzyme-binding site of T. elongatus POR. Hydrogen bonds 
between key residues and NADPH are shownas red dashes. The electron density 
for NADPH (omit 2F, - F, map contoured at 1o) is coloured green. c, Comparison 
of the coenzyme-binding site of the T. elongatus POR and Synechocystis POR. 
The NADPHis coloured as a green (T. elongatus) or red (Synechocystis) stick. 

d, Comparison of the conformation of Arg38 in apo- and NADPH-bound 
structures of the 7. elongatus POR. The electron density for Arg38 (2F,, —F, map 
contoured at 1o) inthe apo-POR structure is coloured blue; in the NADPH-POR 
complex, Arg38 is coloured cyan. e, Potential water channel surrounding key 
residues in NADPH-bound structure of the T. elongatus POR. Water molecules 
are shownas red balls, and hydrogen bonds between the key residues and water 
molecules are shownas red dashes. 


Lys197 


Gly17 


Thr145 leads to an impaired ability of POR to bind Pchlide substrate**”°. 
This finding is consistent with the identified salt bridge and hydrogen 
bond made by these residues in their interaction with the propionic acid 
side chain of Pchlide. Additional Pchlide analogues modified at a range 
of locations of the porphyrin retain an ability to bind to the enzyme. 
These analogues also act as competitive inhibitors, albeit with affini- 
ties that are reduced compared to Pchlide itself (Extended Data Fig. 6, 
Extended Data Table 1b). Loss of the central Mg”* ion and the keto group 
(C-13 position) led to about a 3-5 fold increase in K,, as expected from 
proposed roles in binding via coordination with water molecules in 
the active site. Although Lys156 is situated in the loop region of the lid 
(in close proximity to Pchlide), activity measurements using a K156A 
variant indicate that Lys156 is not important for binding (Extended 
Data Table 1b, Supplementary Fig. 8). This is consistent with the lack 
of hydrogen-bonding from Lys156 to Pchlide in the final model of the 
ternary complex. Molecular-dynamics simulations indicate Tyr223 
facilitates the binding of Pchlide in the orientation that is required for 
catalysis via hydrogen bonding to the keto group at an intermediary 
phase in the binding process (R, of about 17 Ain the potential of mean 
force in Extended Data Figs. 5, 7) but does not directly interact with 
Pchlide in the ternary complex (Extended Data Fig. 6, Supplementary 
Video 1). This is consistent with experimental binding data, in whicha 
considerable reduction of about four- and tenfold is observed in the 
binding of Pchlide to Y223F and Y223A variants of the PORs, respectively 
(Extended Data Table 1b, Supplementary Fig. 8). 
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Fig. 3 | Structural model of the POR-Pchlide-NADPH ternary complex. a, The 
overall structure of the POR-Pchlide-NADPH ternary complex showing the 
structural changes upon Pchlide binding. The loop, helix and Pchlide molecule 
coloured in green show the Pchlide entering the binding pocket; and loop, helix 
and Pchlide molecule coloured in red show the final binding position of Pchlide. 
b, View of the Pchlide-binding pocket. The red surface represents hydrophobic 
residues and the blue surface represents hydrophilic residues. c, Structure of the 
active site of the T. elongatus POR, showing the hydrogen-bonding network 
around the Pchlide molecule. Water molecules are shown as red balls; potential 
hydrogen bonds are shownas red dashes. 


The POR crystal structures and ternary complex model provide a 
structural basis for understanding light-activated catalysis in POR. The 
extensive hydrogen-bonding network identified between active-site 
residues and Pchlide is likely to be essential for photochemistry, and these 
interactions are proposed to strengthen in the excited state to create an 
electron-deficient site at the C-17-C-18 double bond®”. The central Mg”* 
ionand keto group at the C-13 position of the Pchlide are also important in 
Pchlide photochemistry (Extended Data Fig. 8, Supplementary Fig. 9), as 
they facilitate charge separation across the Pchlide during photoexcita- 
tion”. The active-site architecture of POR (Extended Data Fig. 9) is finely 
tuned to facilitate this excited-state charge separation and to stabilize the 
strengthened dipole across the Pchlide molecule (Fig. 4). The positive end 
of the dipole is located within the hydrophobic pocket of the active site, 
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Fig. 4|A structural basis for light-dependent reaction chemistry in POR. 
Mechanistic scheme that illustrates the potential structural reorganization of 
the Pchlide-binding site upon excitation. 


Polar residues 


inwhichitis stabilized by m-m stacking interactions with a conserved Phe 
residue. By contrast, the negative end of the dipole is found ina region of 
polar residues. Interactions between Thr145 and Lys197 of POR and the 
propionic acid side chain of Pchlide at C-17 are important to excited-state 
chemistry, as shown by the fact that changes to either residue results in 
impaired photochemistry’°””*. These excited-state interactions between 
Pchlide and POR stabilize a highly polarized C-17-C-18 double bond and 
enable stepwise hydride transfer from NADPH to C-17". The polarized 
nature of the C-17-C-18 bond may also be stabilized by the close proximity 
of the hydroxyl group of Tyr193, which is known to be required for Pchlide 
photochemistry”’. By contrast, Tyr223 does not form any direct interac- 
tions with Pchlide (Extended Data Fig. 6, Extended Data Table 1b) and— 
consistent with this finding—the POR variants Y223A and Y223F are ableto 
catalyse hydride transfer from NADPH to Pchlide (Supplementary Figs. 10, 
11). The donor-acceptor distance between the NADPH pro-S hydrogen 
and Pchlide C-17 is 4.5+0.3 A (Extended Data Fig. 6b), but this may change 
slightly upon photoexcitation of the Pchlide molecule. The subsequent 
transfer ofa second proton to the C-18 of Pchlide is from the strictly con- 
served Tyr193 of POR” (Extended Data Fig. 6c) with a donor-acceptor 
distance of 4.9 + 0.4 Ain our model—although this distance may change 
upon formation of a Pchlide anion following reduction of the C-17-C-18 
double bond. In reactions catalysed by other members of the SDR fam- 
ily, the catalytic Tyr is replenished through a proton relay mechanism”. 
On the basis of the crystal structures reported here, the adjacent Lys197 
is implicated in modulating the ionization properties of Tyr193. As the 
Y223A and Y223F variants of POR possess reduced rates of proton transfer 
(Supplementary Fig. 11), the water network coordinated by Tyr223 may 
also be important in this proton relay mechanism. The crystal structure 
of POR reported here should enable computational and time-resolved 
structural-mechanistic studies of the complete enzyme reaction cycle 
to provide spatial, temporal and energetic understanding across multiple 
timescales (for example, femtosecond-to-second), and thereby address 
a major challenge in biological catalysis. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Crystal structures of the T. elongatus and 
Synechocystis PORs. a, T. elongatus apo-POR.b, T. elongatus POR with bound 
NADPH. c, Synechocystis POR with bound NADPH. The protein structure is 
coloured according to secondary structure; cyan, a-helix; magenta, B-sheet; 
green, loop. d, Alignment of the POR crystal structures that we solve here. 
Magenta, apo structure of 7. elongatus POR; yellow, structure of NADPH-bound 


27. Ashkenazy, H. et al. ConSurf 2016: an improved methodology to estimate and visualize 
evolutionary conservation in macromolecules. Nucleic Acids Res. 44, W344-W350 (2016). 


Average Conserved 


Variable 


T. elongatus POR; blue, structure of NADPH-bound Synechocystis POR; orange, 
missing loops in T. elongatus POR (whichare present in the crystal structure of 
the Synechocystis POR). e, Evolutionary conservation of Synechocystis POR 
structure. Evolutionary conservation of amino acid positions inthe 
Synechocystis POR protein has been performed on the basis of the phylogenetic 
relations between homologous sequences, using the online ConSurf server”. 
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Extended Data Fig. 2| Comparison of the structure of the SynechocystisPOR — secondarystructure).b, Alignment of the Synechocystis POR (blue) and 
structure with that of the SDR-family protein B-ketoacyl reductase (PDB B-ketoacyl reductase (orange). NADPH is shownasaredstick. 
code 2B4Q).a, Overall structure of B-ketoacyl reductase protein (coloured by 
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Extended Data Fig. 3 | Molecular-dynamics simulations of the T. elongatus 
POR. a-c, Overlays of structures from unrestrained molecular-dynamics 
simulations at 500-ps intervals for apo enzyme (a), POR-NADPH complex (b) 
and the ternary complex (c). d, Root-mean-squared deviation (RMSD) versus 
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time (top) and the per-residue root-mean squared fluctuation (RMSF) (bottom), 
calculated for the non-hydrogen atoms of the protein. e, Distance distributions 
for the Pchlide-binding coordinate Rp, and the distances between the hydride 
donor and acceptor atoms (R,) and the proton donor and acceptor atoms (R,). 
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Extended Data Fig. 4| Electron density map for NADPH bound tothe representation of the aerobically soaked high-resolution NADPH-POR structure 
T. elongatus and Synechocystis PORs. a, NADPH anaerobic soaking with the and associated active-site interactions are shown along witha stick 

T. elongatus POR crystal. A low-resolution 3.5 A electron density map (field- representation of the anaerobically soaked NADPH in green. b, Structure of the 
emission microscopy) contoured at 1o (green mesh) along withanF, — F. omit NADPH-binding site of the Synechocystis POR. Hydrogen bonds between key 
map contoured at 40 (magenta mesh) is shown for the NADPH region of an residues and NADPH are shownas red dashes. The electron density for NADPH 


anaerobically soaked T. elongatus POR crystal. Anall-atom coloured stick (omit F, - F, map contoured at 30) is coloured green. 
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Extended Data Fig. 5 | Modelling of the T. elongatus POR-Pchlide-NADPH 
ternary complex. a-c, Three stages of modelling the 7. elongatus POR-Pchlide— 
NADPH ternary complex. The structures are aligned and superimposed. Each 
panel highlights one of the three structures in blue (with the other two shownin 
grey): crystal structure with the chosen docked Pchlide structure (a); the 
structure after 20-ns annealing (b); and the final representative structure (c). 
The flexible residues during docking are shown as a yellow ball-and-stick 
representation, and the Pchlide molecule is shown in green stick representation. 
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d, Potential of mean force (PMF) calculated by umbrella sampling as a function 
of Ry (Supplementary Methods) for 50-ns umbrella sampling (black), as well as 
for the first 20 ns (blue) and the last 20 ns (red). e, Population distributions for 
each bin, each sampled for 50 ns. f, Potential of mean force was calculated by 
umbrella sampling as a function of R, (distance between Pchlide Mg” and lower 
edge of the POR binding pocket) (Supplementary Methods). The dip in the 
potential of mean force at an R, of about 17 Acorresponds to the formation of a 
hydrogen bond between Y223 and the Pchlide keto group. 
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Extended Data Fig. 6|See next page for caption. 


Extended Data Fig. 6 | Configuration of the active site and donor-acceptor 
distance of the T. elongatus POR-NADPH-Pchlide ternary complex model. 
a, Active site of the T. elongatus POR-NADPH-Pchlide ternary complex. The 
hydrogen-bonding network around the Pchlide and NADPH molecules is shown 
as red dashes. The donor-acceptor distance for hydride and proton transfer is 
shownas blue dashes. Water molecules are shown as red balls. b, View of the 
active site of the 7. elongatus POR, highlighting the donor-acceptor distance for 
hydride transfer (shown asa blue dashed line). c, View of the active site of the 

T. elongatus POR, highlighting the donor-acceptor distance for proton transfer 


(shown asa blue dashed line). d, Summary of the activity, binding and inhibition 
data for the Pchlide analogues. The structures and apparent Ka, Km, Kgand K; 
(where applicable, in each case) are shown for Pchlide (I), protopheophorbide 
(II), Pchlide with a C-17 methylester (III), Pchlide with a C-13-OH (IV) and Pchlide 
witha C-13 and C-15 methylester (V). The red circles show the regions of the 
Pchlide molecule that have previously been shown to be important for activity 
(central Mg”, ring Eand the side chain at the C-17 position). The structures of all 
of the Pchlide derivatives described in the present study are shown with the 
modifications indicated by dashed red circles. 
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Extended Data Fig. 7 | Hydrogen-bonding interactions between Tyr223 and 
the C-13 keto group during Pchlide binding. a~g, Change in hydrogen bonding 
between Tyr223 and Pchlide C-13 keto group during Pchlide binding. Plots of the 
distance Ry between Pchlide Mg” and base of POR binding pocket (black), and 
the distance R,,, (HB, hydrogen bonding) between the Tyr223 hydroxy proton 
and keto oxygen (blue) from 20-ns molecular dynamics simulations at 
increasing R, values. h, Average Ry, for each 0.5 A bin for Ro. i, Extended 100-ns 


molecular-dynamics simualation with Ry restrained at 1.7 A to further illustrate 
the stability of the hydrogen bond between Y223 and the keto group.j—m, As the 
Pchlide leaves the binding pocket (shown sequentially fromjto m), the Tyr 223 
residue is free to move around and forma transient hydrogen bond with the C-13 
keto group of Pchlide to ‘guide’ the Pchlide into its final orientation. The protein 
backbone is shownasa grey cartoon. At each stage, the Pchlide and Y223 
molecules have been highlighted with thicker sticks in the figure. 
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Extended Data Fig. 8 | Evolution associated difference spectra that result 
from global analysis using a sequential model of visible transient 
absorption data collected between 0.6 ns and 2.7 ps. a-h, Evolution 
associated difference spectra (EADS) are shown for Pchlide (a), POR with NADPH 
and Pchlide (b), protopheophorbide (c), POR with NADPH and 
protopheophorbide (d), Pchlide with a C-13 and C-15 methylester (e), POR with 
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NADPH, and Pchlide with a C-13 and C-15 methylester (f), Pchlide with a C-13-OH 
(g), POR with NADPH, and Pchlide with a C-13-OH (h). All data could be fitted 
using two EADS, except for POR with NADPH and Pchlide, which required three 
EADS owing to the formation of the hydride transfer intermediate (spectrumin 
red inb). The absence of any additional intermediates for the Pchlide analogues 
in the presence of POR implies impaired photochemistry. 
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Extended Data Fig. 9 | Potential interactions in the T. elongatus POR ternary 
complex model. a, The Pchlide molecule was chosenas the ligand, andthe POR 
protein and solvent were chosenas the receptor. The 2D interaction map was 
calculated through molecular operating environment software (Chemical 
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Computing Group). b, The NADPH molecule was chosen as the ligand, and the 
POR protein and solvent were chosen as the receptor. The 2D interaction map 
was calculated through molecular operating environment software (Chemical 
Computing Group). 


Extended Data Table 1| Data collection and kinetic parameters of POR 


a 
TePOR* 
Data collection 
Space group P3,21 
Cell dimensions 
a, b,c (A) 66.85 66.85 133.01 
a, B, y (°) 90 90 120 
Resolution (A) 53.08 - 1.27 
(1.315 - 1.27)** 
Rmerge 0.0928 (0.8745) 
I/ol 14.73 (2.36) 
Completeness (%) 98.76 (96.46) 
Redundancy 17.5 (12.1) 
Refinement 
Resolution (A) 53.08 - 1.27 
(1.315 - 1.27) 
No. reflections 91463 (9068) 
Rwork / Réree 0.1526 / 0.1858 
No. atoms 
Protein 2309 
Ligand/ion 2 
Water 363 
B-factors (A?) 
Protein 19.09 
Ligand/ion 21.17 
Water 34.55 
R.m.s. deviations 
Bond lengths (A) 0.010 
Bond angles (°) 1.17 
b 
Keat 
Pchlide and its analouge 7 
or TePOR variants (s) 
Pchlide (I) 0.023 + 0.001 
Protopheophorbide (II) NA 
C17 methylester Pchlide (III) NA 
C130H Pchlide (IV) NA 
C13C15 methylester Pchlide (V) NA 
K156A 0.017 + 0.001 
Y223A 0.020 + 0.004 
Y223F 0.017 + 0.001 


TePOR-NADPH 
P3,21 


66.99 66.99 131.97 
90 90 120 

43.57 -2.1 

(2.175 - 2.1) 
0.1092 (0.7126) 
12.60 (3.18) 

99.96 (100.00) 

9.2 (8.9) 


43.57 -2.1 
(2.175 - 2.1) 
20674 (2023) 
0.1672 / 0.1997 


2224 
48 
158 


37.01 
50.22 
41.16 


0.007 
0.7 


K,,/K; Pchlide! 


(uM) 


K, Pchlide 
(uM) 
3.6+0.4 1.9+0.7 
8.5+0.3 


NA 


9.7+3.5 
NA 

9.7+1.1 6.1+1.2 
50.8 + 3.6 46.7+4.6 
3.11 +0.55 NA 
33.2 + 10.2 NA 


11.6 + 1.98 NA 


SsPOR-NADPH 
P2,221 


56.96 72.89 155.99 
90 90 90 

28.57 - 1.87 

(1.91- 1.87) 

0.097 (0.881) 
11.80 (2.1) 

94.90 (86.50) 

5.6 (6.0) 


28.57 - 1.87 
(1.91- 1.87) 
51836 (3223) 
0.170 / 0.179 


4761 
97 
416 


23.2 
15.2 
32.4 


0.019 
1.472 


Kpydride Koecton 


(«10° s') 
2.36+0.11 
NA 
NA 
NA 
NA 
2.40 + 0.03 
1.77 + 0.42 
1.80 + 0.15 


(x10* s') 
2.23 + 0.05 
NA 
NA 
NA 
NA 
2.26 +0.14 
2.39 + 0.46 
0.93 + 0.07 


a, Data collection and refinement statistics (molecular replacement). *A single crystal was used for each data collection. **Values in parentheses are for the highest-resolution shell. b, Kinetic 
parameters of the T. elongatus POR substrate analogues and variants. 'K, values were measured for non-active substrate analogues. NA, not applicable for non-active substrate analogues. The 
values shown in the table are mean + s.e.m., n= 3 for steady-state kinetics measurements, n = 5 for hydride and proton transfer measurements. 
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Prostate cancer: A declining art 


Brachytherapy is an established treatment for prostate cancer with much to 
recommend it, but it is losing ground to flashier therapies. By Michael Eisenstein 


odern medicine advances so quickly 
that it might be surprising to learn 
that a 100-year-old treatment for 
prostate cancer is still relevant. 
Brachytherapy, whichinvolves bom- 
barding the tumour with radiation from isotopes 
positioned aroundit, has atrackrecordasasafe 
and effective procedure, and is less costly than 
other interventions such as robot-assisted sur- 
gery. Nevertheless, many oncologists are con- 
cerned that the technique could fall out of use. 

Inthe first demonstration of brachytherapy in 
1911, French physician Octave Pasteau used aure- 
thral catheter containing radium. The technique 
evolved, and by the 1980s, a version known as 
low-dose-rate (LDR) brachytherapy, whichis still 
inusetoday, had emerged. The therapy involves 
injecting ‘seeds’ of radioactive iodine or palla- 
diumintothe gland, with the help of ultrasound 
imaging. These seeds are permanently embed- 
ded, releasing radiation for several months. 

The process poses some problems: implanted 
seeds can expose patients’ sexual partners to 
radiation, andthe seeds can migrate into healthy 
tissues over time, for example. Another iteration 
of thetreatment, knownas high-dose-rate (HDR) 
brachytherapy, remedies this by temporarily 
introducing iridium isotopes into the prostate 
inside catheters. 

LDR brachytherapy has long been used to 
treat people with prostate tumours, and the 
clinical performance of the HDR variety is 
promising. Both are delivered alone or along- 
side other treatments. But the use of both forms 
isin decline. In 2002, 17% of peopleinthe United 
States with prostate cancer received the treat- 
ment; by 2010, that number had fallen to just 8% 
(J.M. Martin etal. Cancer120, 2114-2121; 2014). 

In part, this decline can be ascribed to the 
fact that aggressive treatment by any method 


has become less common — many clinicians 
now opt instead to keep a close eye on low- 
risk tumours. But brachytherapy is also being 
eclipsed by more technologically sophisticated 
treatments suchas robot-assisted surgery and 
proton therapy — a shift partly facilitated by 
hospital-reimbursement policies that favour 
newer approaches. The fall has alarmed many 
oncologists and radiotherapists, with some 
suggesting that it could lead to a decline in 
cure rates. 

Without action, brachytherapy’s decline in 
use seems set to continue. The drop means 
fewer opportunities for medical students 
and residents to see the technique in action 
—a feedback loop that, according to asurvey 
of US radiation-oncology residents, might 
already be affecting their familiarity with the 
technique (N. Nabavizadeh et al. Int. J. Radiat. 
Oncol. Biol. Phys. 94, 228-234; 2016). But 


the radiation-oncology community is taking 
steps to ensure that brachytherapy remains an 
option. The American Brachytherapy Society 
in Reston, Virginia, has embarked on an ini- 
tiative to train 30 practitioners in the tech- 
nique every year for the next decade. And 
the American Society for Radiation Oncol- 
ogy is lobbying the US Centers for Medicare 
and Medicaid Services to re-evaluate how it 
reimburses hospitals for certain treatments. 
Brachytherapy has a long history, and many 
practitioners think that if it disappears, it is 
patients who will lose out. 
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KEEPING TREATMENT OPTIONS OPEN 


People with prostate cancer currently have several treatment options available to them. But one of the oldest, brachytherapy, 
is losing popularity with physicians. Without action, the skills needed to perform this effective therapy could be lost. 
By Michael Eisenstein; infographic by Lucy Reading-Ikkanda 


OLD THERAPY, NEW APPROACH 


Brachytherapy uses radioactive material to kill 
cancerous cells. The conventional approach 
involves permanently implanting radioactive 
pellets in and around the prostate. In a newer 
form of brachytherapy, radioactive material is 
introduced temporarily through removable 
catheters. This might improve safety and 
efficacy, although only limited clinical data are 
available. 
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A QUESTION OF RISK 


Low-dose-rate brachytherapy 
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into the 
prostate and 
surrounding } 
tissue. The 
placement of 

the iodine-125 | 
or palladium-103 
seeds is | 
permanent, and - 
cannot be altered 
after injection. 
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Dose: Low-energy isotope 
seeds emit radiation over 


an extended period. 

Facilities: Outpatient procedure 
with no specialized facilities. 
Administration: Permanent 
implantation means greater care 
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is needed with seed placement. 


Evidence: Decades of clinical data 
support low-dose-rate (LDR) 
brachytherapy as a stand-alone 
and adjunct treatment. 


Risk potential: Seeds can migrate 
to other tissues. Sexual partners 
might be exposed to radiation until 
the isotope decays. 


Brachytherapy is one of several treatment options for prostate cancer. Treatment suitability 
depends on a tumour’s level of risk, determined on the basis of factors such as size and cellular 
structure. Brachytherapy can be used alone for lower-risk tumours, or to boost more aggressive 


radiotherapy or hormone therapy in higher-risk cases. 
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Active surveillance 


Many clinicians now opt 
for careful, regular 
monitoring, rather 

than immediate 
treatment, for people 
with low-risk, 
slow-growing tumours. 


Beam therapy 


For higher-risk disease, 
people are typically 
offered a more 
aggressive radiotherapy 
course, such as 
external-beam radiation 
therapy (EBRT). 
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Prostate i 


Testosterone 


Radical prostatectomy 


The prostate and seminal 
vesicles, which also secrete 
fluid found in semen, can be 
surgically removed to excise 
the tumour. This type of 
surgery is often robot-assist- 
ed for greater precision. 


Tumour growth can 
be stalled by 
reducing levels of 
hormones such as 
testosterone through 
drugs or surgical 
castration. 
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Hormone therapy 


Active 
surveillance 


EBRT 
Brachytherapy 


Radical 
prostatectomy 


EBRT 
with 
brachytherapy 


EBRT 
with 
hormone therapy 


EBRT 

with 
brachytherapy 
and 

hormone therapy 


High-dose-rate brachytherapy 


Soft catheters 


are inserted into 
affected areas of 
the prostate, and 
radioactive 
iridium-192 
sources are 
placed in the 
catheters for 
each treatment 
session, which 
lasts around 2 
hours. Multiple 
treatments are 
given over 1-2 
days in hospital. 


Dose: High-energy isotopes 
are applied in multiple, 
brief sessions. 


Facilities: Specialized 
equipment and a shielded, 
radiation-safe site. 


Administration: |sotope 
positioning can be adjusted 
between treatment sessions. 


Evidence: Robust 
comparisons of 
high-dose-rate (HDR) 
brachytherapy with other 
therapies are still needed. 


Risk potential: No radioactive 
material is left in the body 
after treatment, reducing the 
risk of problems later. 


Tumour risk 
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Watch an animation at 
go.nature.com/2vkgz7j 
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A VALUABLE OPTION 


Randomized controlled trials have shown that brachytherapy has similar 
benefits as those of other treatments, but can be delivered at a lower cost 
and, in many cases, with more easily managed side effects. 


Survival gains 


In people with low-risk tumours, brachytherapy is as effective (96%) as robotic 
surgery (97%) at preventing cancer recurrence within two years?. For high-risk 
disease, combining brachytherapy with EBRT boosts progression-free survival 
compared with EBRT alone®. 
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Cost-effective 


The cost of brachytherapy in US hospitals compares favourably with that of 
cutting-edge treatments such as advanced forms of beam therapy and 
robot-assisted surgery‘. In Europe, treatment costs are more closely matched. 


Cost per patient, after five years of follow up 
(US$, thousands) 


Active LDR HDR 
surveillance (Brachytherapy) 


Intensity-modulated 
radiation therapy 
(Beam therapy) 


Robotic 
surgery 


Side-effect selection 
Compared with surgery, brachytherapy tends to carry a lower risk of causing 


sexual dysfunction and incontinence. But it is associated with greater rates of 
urinary-tract obstruction than are other treatments®. 
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SLIDE AWAY 


Despite its apparent benefits, brachytherapy is struggling to compete with a 
growing clinical armamentarium. Worldwide, dwindling training 
opportunities put the future of this technique in jeopardy. 


Decreasing use 

The proportion of = Radical prostatectomy «= EBRT 

US medical © Brachytherapy Surveillance 
facilities performing 

brachytherapy 60 


has been falling 
each year since 
2007, partly 
because clinicians 
are delaying 
treatment for 
low-risk 
malignancies in 
favour of active 
surveillance. But 
brachytherapy’s 
use is also sliding 
compared EBRT 
and surgery®. 
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Funding issues 

In the United States, 
reimbursement rates 
by health insurer 
Medicare for 


Profit per patient to hospital (US$) 


Medicare 7) 
reimbursement 


brachytherapy (profit $3,100 

for LDR shown) are 

closer to the true cost Cost to 
than for other, more treat patient 
costly therapies, such $2,700 _) 


as intensity-modulated 
radiation therapy 
(IMRT). This means the 
other therapies are 
more profitable’. In 


Medicare ~) 
reimbursement 


Europe, the $9,700 
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always cover the cost 
of brachytherapy. 


Opportunity cost 


As brachytherapy’s popularity declines, so do the opportunities for clinicians 
early in their career to observe and learn the technique. US medical residents 
increasingly report insufficient opportunity for training in brachytherapy 
relative to newer therapies®. This could lead to further reductions in access. 


Adequate training in technique 
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brachytherapy 


(e} 20 40 60 80 100 
US medical residents reporting adequate exposure in 2014 (%) 


HDR 
brachytherapy 


SOURCES 1. GLOBOCAN 2018. 2. G. Claudio et al. Can. J. Urol. 24, 8728-8733 
(2017). 3. W. J. Morris et al. Int. J. Rad. Oncol. 98, 275-285 (2017). 4. A. A. 
Laviana et al. Cancer 122, 447-455 (2016). 5. R. C. Chen et al. J. Am. Med. 
Assoc. 317, 1141-1150 (2017). 6. National Cancer Data Base. 7. S. W. Dutta et 
al. Brachytherapy 17, 556-563 (2018). 8. N. Nabavizadeh et al. Int. J. Radiat. 
Oncol. Biol. Phys. 94, 228-234 (2016). 


Nature | Vol 574 | 31 October 2019 | $83 


© 2019 Springer Nature Limited. All rights reserved. 


SINO IMAGES/GETTY 


Advice, technology and tools 


Work 


Send your careers story 
to: naturecareerseditor 
@nature.com 


TAKING RESEARCH 
FROM DESIGN TO 


LIFT-OFF 


If you want an astronaut to run your 
experiment in outer space, you have 
to keep it simple. By Brian Owens 


n 25 July, after years of planning, an 

experiment that Charles Cockell had 

spent years planning blasted into 

space. A SpaceX rocket launched from 

Florida, heading to the International 

Space Station (ISS). It carried 18 bioreactors, 

each the size of a deck of cards, that would be 

used to study whether bacteria could mine use- 
ful minerals on the Moon, Mars or asteroids. 

Getting the experiment off the ground 

(literally) was “one of the most exciting things 

I’ve experienced”, says Cockell, an astrobiologist 


at the University of Edinburgh, UK. But the 
process of getting from proposal to lift-offwas 
long and involved: Cockell’s biomining experi- 
ment was more than 11 years in the making. 

There are various routes to the ISS, but 
most go through one of the five space agen- 
cies — those in Canada, Japan, Russia and the 
United States, and the 11-nation European 
Space Agency (ESA) — that support the space 
station. These agencies periodically release 
calls for research proposals that help to meet 
their space-science goals. 
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“Most of our research has an eye toward 
enabling exploration, but it can also have ter- 
restrial benefits,” says Craig Kundrot, director 
of NASA’s Division of Space Life and Physical 
Sciences Research and Application. The US por- 
tion of the ISS has been designated a national 
laboratory, meaning that it’s available for 
any research that would benefit from access 
to space, even if the project is not aimed at 
advancing space exploration. 

Cockell’s experiment is intended to study 
how future Moon or Mars missions could use 
bacteria to extract materials from rocks in 
space, including minerals and metals for con- 
struction, water for rocket fuel, and soil. 

Around 600 experiments are conducted 
each year on the station, which has extremely 
limited space, power and time allocated to 
astronauts to work on experiments. The pay- 
load specialists for research onthe ISS — those 
who plan what to spend and when — have to 
juggle all of these factors as they decide which 
experiments can go when, and how these will 
all fittogether. “They’re playing tetris ona daily 
basis,’ says Kundrot. 

Getting chosen by aspace agency is just the 
start of the process. Then comes the tricky 
business of designing an experiment that 
can be packed into a rocket, blasted to the 
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NASA astronauts Christina Koch and Andrew Morgan stow biological research samples in a science freezer on the International Space Station. 


LI 


station and conducted by an astronaut who 
has hundreds of other responsibilities each 
day. It must also survive re-entry into the 
atmosphere and recovery, sometimes from 
hard-to-reach places suchas the middle of the 
Pacific Ocean. Luckily, there are teams of engi- 
neering contractors and payload specialists 
that take the lead on making each experiment 
function within the limits on mass, size and 
power consumption. 

“AS a Scientist, you’re not responsible for 
getting it to work. There’s a whole team who 
support the technical aspects,” says Monica 
Driscoll, a molecular biologist at Rutgers 
University in Piscataway, New Jersey. She was 
involved in a project running from December 
2018 to January 2019 that used the nematode 
worm Caenorhabditis elegans to study the 
neurological effects of space flight. 

Depending on the experiment, the engi- 
neering challenge can be relatively simple 
or fiendishly complicated. Physical-science 
endeavours tend to take longer to plan 
because they need bespoke equipment, 
whereas those in the biological sciences can 
often repurpose gear that worked in other pro- 
jects. Driscoll had the benefit of drawing on 
the experience of previous work on C. elegans 
in space, but Cockell’s bioreactors had to be 
designed from scratch — part of the reason for 
the long timeline. 

On the experimental design side, the chal- 
lenge isto keep the project as simple as possible 
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while getting useful results. Something as basic 
as using a freezer, which wouldn't require a sec- 
ond thought in alab on Earth, can add another 
layer of complexity. The freezer on the ISS, 
which is designed to operate in microgravity, 
has a smaller storage capacity than a freezer in 
adomestic refrigerator, limiting the number of 
experiments that can use it. It has a waiting list, 


“Ifyou want an astronaut 
toturnaknob, ithas to be 
in their schedule.” 


says Kundrot. Plus, you have to work out how 
to keep the samples frozen during re-entry and 
recovery. That’s why Cockell’s team decided to 
forgo freezing its samples and instead opted 
to keep them in cooling packs for the return 
to Earth. 

“Even simple things are complicated,” says 
Cockell. “If you want an astronaut to turna 
knob, it has to be in their schedule.” 

The experiment then undergoes several dry 
runs on Earth. One to verify that the science 
will work — for example, that Cockell’s bacteria 
were able to growin the bioreactors — and then 
another, more detailed run-through of the 
operation of the experiment as if it was actu- 
ally on the station. Once it has passed those 
tests to the satisfaction of the researchers, the 
space agency and the engineering contractors, 
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the experiment goes to a launch site, such as 
the Kennedy Space Center in Merritt Island, 
Florida, and is loaded onto a rocket. That, says 
Cockell, is when the excitement of what you 
are doing really hits home — as your work is 
readied to leave the planet. 

“The station is not actually that far away — 
just 400 kilometres up, less than the distance 
from Edinburgh to London,’ says Cockell. “But 
it’s the frontier of physical difficulty.” 

Cockell’s experiment is now back on Earth, 
and he and his team are getting started on 
analysing the results. But they are already 
planning more work to send back to space. In 
two years, they will conduct another version of 
the experiment, this time bypassing the space 
agencies and using commercial contractors 
— Kayser Space in Didcot, UK, and its sister 
company Kayser Italia near Livorno, Italy. For 
£170,000 (US$207,000), the researchers have 
bought access to the ISS for their experiments, 
without having to rely onthe ESA peer review- 
ers to approve their work. Cockell says that it’s 
“just like buying a plane ticket to do researchin 
another country”. Opportunities like this are 
opening up space research in a way that would 
not have been conceivable just a few years ago, 
he says. “It’s exciting for future generations. 
Within the next few years, this is something 
that alot more people will be able to do.” 


Brian Owens is a freelance science journalist 
based in St. Stephen, New Brunswick, Canada. 
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s part of my work as a statistician, | 
research virtual habitats, including 
models of areas likely to be 
inhabited by jaguars. Here, I’m on 
Lake Imiria in Peru, ina wooden 
canoe made by Indigenous Shipibo villagers. 
Our team went out early in the morning to 
spot jaguar prey suchas capybaras, peccaries 
and turtles, and to search for jaguars — or at 
least try to detect their calls or footprints. 

It looked so serene with the huge trees 
and calm water, but the air was a cacophony 
of bird calls and mosquitoes — and was 
sometimes punctuated by shouts from team 
members when they glimpsed a sloth ora 
caiman. Boats are the best way to get around 
because the forest is very hot, very dense and 
potentially dangerous. 

Because jaguars are rare and elusive, there 
are very few recorded observations of them. 
So my team at Queensland University in 
Brisbane, Australia, uses virtual reality (VR) to 
help understand them. We take photographs 
of selected sites that jaguars might inhabit 
and turn these photos into VR scenes. Then, 
instead of taking jaguar experts to the 
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jungle, we take the jungle to the experts. 
These are a mix of local Indigenous people, 
applying their knowledge of the region, and 
international experts. We immerse these 
specialists in different locations in our virtual 
jungle and ask them: “How likely is ajaguar to 
live, move through or hunt in this area?” 

This immersive environment helps people 
to recall and identify important details that 
we need to build our statistical models. 
These predict where jaguars are most 
likely to roam, and are being used to guide 
conservationists in Peru who are building a 
corridor between protected areas. 

For example, when the local people used 
our VR headset, they told us about the 
importance of specific fruit trees that the 
jaguars’ prey rely on. I think of this human 
knowledge as data that are hiding in these 
experts’ brains. The only way to tap those 
data is to put the experts right there, in the 
jungle scene. 


Kerrie Mengersen is a statistician at 
Queensland University of Technology in 
Brisbane. Interview by Kendall Powell. 


